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ABSTRACT 


The current demerit point system in use in Ontario allocates points to 
offences on the basis of the perceived seriousness of the offence. The goal 
of the work reported here was to allocate points to offences with a different 
purpose in mind. The purpose here was to use a driver’s record of 
convictions and accidents in order to predict, as well as possible, which 
drivers, based on their past record, are most likely to have an accident in the 
near future. 


A sample of 827,955 records of Ontario drivers containing information about 
age, gender, convictions, accidents, demerit points and suspensions, for 1981- 
1984 was examined. On this basis 16 new models to estimate a driver’s 
accident potential were formulated. Some models used age and gender 
information, others did not. Some models assigned different weights to each 
of the conviction categories; in other models all convictions were weighted 
equally (equivalent to 1 point per conviction). In some models accident data 
were used; in other models, not. In contrast to the current system, 
information on all types of convictions, not just moving convictions, was used. 
It turns out that these new models are better at predicting a driver’s accident 
potential than the current demerit point system. . 


The performance of the new models is described by means of two measures. 
The first measure is the number of accidents in period 2 (1983-1984) based 
on the driver’s record of convictions and accidents in period 1 (1981-1982). 
For example, if the 10,000 drivers with the most demerit points in period 1 
are selected, 1452 have accidents in period 2; if the 10,000 drivers with the 
most accidents in period 1 are selected, 1828 have accidents in period 2. 
Alternatively, if the 10,000 drivers are selected on the basis of the best new 
model, which uses age, gender, accidents and separate weights for 14 
offence categories, 2116 have accidents in period 2. 


The second measure of performance is in terms of "hits" and "false 
alarms". "Hits" are drivers who have an accident potential in excess of 4 
times the population average. For example, if the 10,000 drivers with the 
highest estimated accident potential are selected by the best model, 3757 of 
these are are expected to be "hits". "False alarms" are drivers who have 
an accident potential which is below the population average. Of the same 
10,000, one should expect 792 to be "false alarms". The best model 
identifies approximately twice as many high accident potential drivers as the 
current demerit point system. Even the simplest model, which uses total 
convictions as the only variable, predicts 50% more high accident potential 
drivers than the current system. 


ABREGE 


L'actuel systéme de points d'inaptitude en usage en Ontario donne des 
points aux infractions en fonction de la gravité qu'on leur attribue. 
Dans le travail qui nous occupe ici, il s'agissait de donner des points 
aux infractions dans un but différent, plus précisément, de se baser sur 
le dossier des condamnations et accidents des conducteurs pour prédire le 
mieux possible lesquels ont le plus de chances d'avoir un accident dans 
un avenir proche. 


On a donc examiné un échantillon de 827 955 dossiers de conducteurs 
ontariens, contenant des renseignements sur l'age, le sexe, les 
condamnations, les accidents, les points d'inaptitude et les suspensions 
de permis de 1981 4 1984. A partir de ces données, on a établi 

16 nouveaux modéles censés évaluer les chances d'accident d'un 
conducteur. Certains modéles intégraient les données sur l'age et le 
sexe, d'autres pas. Certains modéles pondéraient les diverses catégories 
de condamnations et d'autres les traitaient toutes sur le méme pied 

(un point par condamnation, quelle que soit la nature du délit). 

Certains modéles intégraient les données sur les accidents et d'autres 
pas. Contrairement au systéme actuel, ces modéles tenaient compte de 
tous les types de condamnations, et non uniquement de celles poLrtantasur 
des infractions commises par les conducteurs de véhicules en marche. I] 
S'est avéré que ces nouveaux modéles arrivent i mieux prédire les chances 
d'accident d'un conducteur que l'actuel systéme de points d'inaptitude. 


Les résultats des nouveaux modéles apparaissent dans deux mesures. La 
premiére est le nombre d'accidents pendant la période 2 (1983-1984) en 
fonction du dossier des condamnations et accidents du conducteur pendant 
la période 1 (1981-1982). Par exemple, si l'on retient les 

10 000 conducteurs qui ont eu le plus de points d'inaptitude pendant la 
période 1, on trouve que 1 452 d'entre eux ont des accidents pendant la 
période 2; si l'on retient les 10 000 conducteurs qui ont eu le plus 
d'accidents pendant la période 1, on trouve que 1 828 d'entre eux ont 
des accidents pendant la période 2. Selon l'autre méthode, si la 
sélection des 10 000 conducteurs s'effectue selon le meilleur nouveau 
modéle, basé sur l'age, le sexe, les accidents et 14 catégories 
d'infractions de poids différents, on trouve que 2 116 d'entre eux ont 
des accidents pendant la période 2. 


La deuxiéme mesure classe les conducteurs en €récidivistes> et <fausses 
alertes>. Les récidivistes sont les conducteurs qui ont des chances 
d'accident supérieures A quatre fois celles de la population moyenne. 
Par exemple, si les 10 000 conducteurs qui ont le plus de chances 
d'accident sont sélectionnés par le meilleur modéle, 3 757 d'entre eux 
sont classés comme «récidivistes>. Les fausses alertes»> sont les 
conducteurs qui ont des chances d'accident inférieures a celles de la 
population moyenne. Sur les 10 000 conducteurs cités ci-dessus, on 
devrait en trouver 792. Le meilleur modéle permet d'identifier environ 
deux fois plus de conducteurs ayant des chances élevées d'accident que 
l'actuel systéme de points d'inaptitude. Méme le modéle le plus simple, 
dont la seule variable est le nombre total de condamnations, identifie 
50 % de plus de conducteurs ayant des chances élevées d'accident que 
l'actuel systéme. 
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1.0 INTRODUCTION 


The current demerit point system in use in Ontario allocates points to 
offenses on the basis of the perceived seriousness of the offense. An 
offense is considered serious if it is associated with a relatively large 
chance of precipitating an accident. This is why a non-moving violation, such 
as not having a trailer permit, receives no points, while running a red light 
receives a lot of points. The goal of the work described in this report was 
to allocate points to offenses with a different purpose in mind. The purpose 
here was to use a driver’s record of convictions and accidents in order to 
estimate as well as possible which drivers, based on their past record, are 
most likely to have an accident in the near future. This is done by 
calculating for each driver how many accidents per year he or she is likely 
to have, on the average. 


Because a person’s “accident potential" can only be indirectly estimated (not 
directly measured) and because, mercifully, for any one driver it is rare to 
be involved in an accident, the accuracy with which we can estimate the 
accident potential of a driver is bound to be severely circumscribed. Thus, 
our aim is not only to produce for each driver: an estimate of his or her 
accident potential but also to say how accurate our estimate is. 


This is the kind of knowledge which might then be used in the determination 
of post-licensing control action. Thus, while a non-moving violation may not 
be a threat to traffic safety, such a conviction on a driver’s record may be 
an important clue about that person’s likelihood of future accidents. 


2.0 DRIVER RECORD SAMPLE 


The analysis examined driver records over a recent four-year period, January 
1981 to December, 1984. Convictions and accidents during the first two years 
of the record were used as the basis for estimating a person’s accident 
potential in the second two-year period. 


To be included in the sample, drivers had to have been licensed to drive in 
Ontario throughout the four-year period. To this end, a number of criteria 
were used in the selection of driver records. These criteria were as 
follows. First, drivers not licensed at the beginning of the study period were 
excluded. Second, drivers who might have moved out of Ontario during the 
four year period were excluded. These exclusions were achieved by only 
selecting drivers who renewed their drivers’ license during 1985, with the 
exception of those who were 16 or 17 in 1981. This exception was made 
because such a large percentage of drivers become licensed at 16 and 17, that 
the majority of drivers of this age would be renewing their licenses in 1984 
(3 years after the original licensing) and would have been excluded from the 
sample. Because drivers had to have been licensed in 1987 to be included in 
this study, no drivers on the new probationary license system were included. 


Of the 5.5 million Ontario drivers, 827,995 met these criteria and were 
included in the sample. Data extracted from the MTO driver records 
included the following information: age, gender, and for each conviction, 
type, date, and demerit points assigned. for each accident, degree of 
- severity, date, and for each suspension, type and time period. Details are 
shown in Table 1. 
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Of the 827,995 drivers, 311,691 had no convictions or accidents during 1981- 
1984. To save on storage space, these records were kept in a separate file. 
Of the 516,304 drivers who had at least one conviction and/or accident during 
1981-1984, records for approximately 400,000 were stored in an on-line file. 
The layout of this file is shown in Table 2 


The processing of this amount of data can be costly. For some purposes, 
smaller data sets are sufficient. With this in mind, three sets of driver 
records were assembled: a full set of records, and smaller samples of 
every 40th record and every 400th record. 


3.0 PREPARATION FOR ANALYSIS 


Making sense of large data sets requires careful preparation. The several 
preparatory activities are described below. 


3.1 Checks for Representativeness of the Sample 

Several tests were conducted to check whether: the characteristics of a 
smaller sample (1 in 40 records) were similar to known (published) 
characteristics of the Ontario driving population. In terms of percent of 
male and female drivers, percent of accidents involving males, and accident 
involvement per driver/year the sample data were found to be reasonably 
representative of the Ontario driving population (See Table 3). 


She Consistency Checks 


Driver record data were then checked for consistency. A number of 
inconsistencies were noted. The first inconsistency apparent in the data was 
that while some offenses are logically associated with the occurrence of an 
accident, the count of such offenses exceeded the count of related accidents. 
Thus, for example, one would expect that if a driver were convicted of 
failing to remain at the scene of an accident, he or she would also show an 
accident associated with the offense on that same day. However, this is 
clearly not so; in the small sample there were 45 such convictions, but only 


28 accidents for drivers convicted of this offense. There is a _ similar 
discrepancy for convictions for "following too close" which apparently arise 
mainly from accidents. These discrepancies suggest that many accidents 


related to convictions have not been entered in the driver record file. It is 
possible that this omission might apply to accidents in general. One reason 
for this may be due to the fact that records contain only reports of 
"reportable" accidents whereas charges apply to any. 


A second inconsistency was the considerable year-to-year fluctuation in the 
number of convictions: for certain offenses. For example, the number of 
convictions for driving without a permit increased from O in 1981 to 68 in 
1984. There were seven other conviction types with similar inconsistencies 
from year-to-year. The reason for these inconsistencies is not clear, but 
they could be related to changes in enforcement practices or to modifications 
related to regulations. 
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Year (1981=1, and so on) 
Conviction Code 
Speed Limit (for speeding convictions) 
Recorded Speed for Speeding Conviction 
Demerit Points assigned 
Code 1 for non-accident related, single conviction 
2 for non-acc. related, primary of multiple conv. 
JEeLObEnOn-OCC-—reLloLced = Secondary Of multiples” 
4&4 for accident related, single conviction 
5 for accident related, primary of multiple conv. 
Shor . * , secondary of : us 
Accident Record (Variable -- space for n2 accidents) 
Yeor (1981=1, and so on) 
Severity code 
Apparent Driver Action Code 
Driver Condition Code 
Record of demerit point adjustments (Variable record 
length -- space for n3 adjustments) 
Month # from Jan. 1981, of adjustment 
Sign of adjustment: + = points added 
Demerit Point change 
Code 1 if reinstatement after suspension 


Characteristic Sample Value Population 
value 

% male drivers 53.4% 56% 

% female drivers 46.6% 44S 


% of accidents 
involving males Teeer 72.9% 


% of accidents 
involving females 27 . 8% 27.1% 


Involvements per 
driver/year DB. 964 6.957 


PreOpoOnt1Onerocol.: & 

injury involvements %.35 G.35 

(all accidents) 
* - Population values are estimated from the Ontario MTQ 1985 
Annual Accident report and are for a comparable age distribution. 


The third inconsistency involved the original licensing date. This variable 
was found to be unreliable. Some of the dates were missing and none of the 
the original dates were earlier than 1966 although a substantial number of 
drivers were obviously licensed before that year. This meant that the effect 
of experience on a driver’s accident record could not be examined. This is a 
result of the development of the computer system in 1966 and of changes of 

residence and lapses of renewal that affect the recording of the original date. 


3.3. Accident Related Convictions, Multiple Convictions and At-Fault 
Accidents 


We anticipated that some models for the estimation of accident potential 
would make use of the driver’s accident record. To avoid double-counting, 
convictions which were the consequence of accident involvement had to be 
identified. For the purpose of later analysis, we identified a conviction as 
being accident-related if it occurred on the same date as an accident. Ina 
sample of 16,000 drivers we found 8940 convictions in two years. Of these, 
612 (7%) were for violations which the driver incurred on the same day as 
an accident. 


If several convictions were recorded on the same date, they were assumed to 
be related to a single apprehension. We think that only the "primary" 
conviction should feature in the statistical model; inclusion of secondary 
convictions could obscure the relationship between cause and effect. 


Each offense type was assigned a rank, as shown in Table 4 according to 
how likely it was to be the cause of an apprehension. Thus, for example, 
speeding was given the highest rank. For multiple convictions, the one for 
the offense with the highest rank was called the primary conviction, while 
the others were designated as secondary convictions. In case of a tie, the 
first conviction recorded was designated as the primary one. This logic is 
somewhat arbitrary, but analysis of a data sample showed that only about 4% 
of all apprehensions result in multiple convictions. In addition, some 55% of 
‘the multiple convictions involved speeding - an obvious primary offense. 


The information in Field 3C (See Table 2) - Apparent Driver Action - was 
used subsequently in the determination of whether or not an accident was "at- 
fault". All accidents with apparent driver action codes other than "driving 
properly",- were considered at-fault accidents. These accidents included some 
12% of involvements for which apparent driver action was either "not known" 
or "other". The decision to place these involvements in the "at-fault" 
category was made after discussions with police. Also, since "driving 
properly" already accounted for about 50% of involvements, it was logical to 
assume that the rest of the involvements were "at-fault". 


3.4 Exploratory Analysis 


The sample of every 40th record was used to perform some exploratory 
analyses. Figure 1 depicts the relationship between accidents and convictions 
in four years in a sample of nearly 16,000 drivers. It shows that drivers 
with more convictions tend to have more accidents. Thus, for example, the 
189 drivers with 6 convictions in four years had, on the average, eight times 
as many accidents (line 1) as drivers with no convictions in these four years. 
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TABLE 4: 


Multiple convictions (same day) in sample of 7811 drivers with ranking 
according to reason for apprehension: 


1 = most likely; 4 = unlikely 


Offence Code No. in Of fence Code No. in 
Sample Sample 
Rank 1 Foil rep. dam. to hy. prop. 12768 2 
Dang. driving CCC 7BBEB 1 
Dr. m/v, no valid plates 188354 1 Fail rem at accid. CCC 72898 6 
No plate, operate m/v 18865 3 Danger. driving CCC 76128 2 
Imp./insuff./ drvg. lamps 18428 9 Impaired driving C.C.C. 78174 = 23 
Imp. no. plate lamp 18538 6 Foil/ref. breath test CCC 79185 1 
Excess fumes/smoke 18878 2 Foil or ref. breath sam.CCC 7$212 19 
Unnecess. noise (18888 i Driving with >88 mgs.alcon. 78214 ys 
Horn/bell violation 18898 Vike chad | ate ee 5 eh ee ee eee 
Speeding -- km in --km zn. 11355 211 Ronk 3 
Coreless driving 11496 22 
Foil to stop, intersec. 11588 25 Imp. signal devices 19616 1 
Fail to yield ROW at stop 11598 2 No safe helmet, motor cy. 11996 3 
Fail yield ROW ot yield 11618 1 Fail/imp. use of Seat Belt 11897 85 
Fail to yield ROW, Pr. rd. 11628 2 Insecure load 11266 23 
PXO viol. oth. side of rd. 11658 1 Excess veh. width 11319 1 
Pass at Ped. crossing 11665 2 Excess len. ven-comb. 11328 1 
Imp. RT ot intersection 11695 1 Excess veh. height 11358 1 
Improper left turn 11713 2 Crowding driver seot 12248 1 
Imp. LT at intersection hice oS Sa GP it le aka ipa ae aati en teninth Z pete ty ye 
Improper left turn 11733 1 Rank & 
: " from parked pos. 11768 S) 
Disobey red light 11868 26 Dr. m/v,no valid permit 19833 19 
amber " 11878 2 No valid. tag on plates 18835 7 
Disob. green arrow 11988 1 Draw trailer, no permit 19937 3 
Prohibited turn 11928 86416 foil surr. m/v permit 19938 7 
Disobey red light 11925 2 ” " trl. " 19839 1 
Fail to share road 11948 5 foil not. ch. address 19958 3 
Imp. poss.- appr. traffic 12618 € Fail apply for permit 18971 2 
Driving left of center 12948 (4 Imp. plates on veh. 18136 3 
Pass on right/off r’way 12188 1 No driver's license 18196 39 
Wrong-way,one-way street 12116 2 Driv., cond. pron. 198193 2 
Imp. driving, div. h’way 12126 4 Foil to prod. dr. license 18218 45 
Disobey off.signs,div.n'’way 12146 1 No lic.» fail to IO selt 18214 1 
Fail to stop for emer/veh. 12288 1 Pos. canc/susp. license 19222 2 
Imp. park., Interf. w.traf 12438 1 Unlaw. pos. license 19328 1 
Prohibited turn 82028 1 Dye, SbKe. AVE; Ge We 10352)" 4 
By-low, pron. Turn 88118 1 Def. brakes, contr. to reg. 168686 1 
Teh era ye ati te die a ape ran ono Def./imp tires 187508 2 
Refuse exam. unsafe veh. 18989 1 
Rank 2 Foil to carry/prod.permit 13218 3 
More than 1 license 13278 1 
Imp. reor lamp on tris 19576 1 No insurance 20619 2 
View obstr., sign/poster 16889 1 Op. m/v, no insurance 29611 5 
Rear window obstructed 19858 1 Folse statem. re. insurance 26638 1 
Def/imp/no muffler 18868 4 Fail to have insurance card 26058 26 
F.T.C., motor vehicle 12186 4 n " carry " n 28068 se 
Foil to rep. accident 12718 16 " noon evid. of ins. 26879 1 
Fail remain at scene accid. 12736 18 Dealer plate infraction 98259 1 
Learner unaccompanied 99788 1 
Me " foil not. addr. ch. 99818 6 
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(Convictions which were incurred on the date of an accident were eliminated 
from consideration). The same pattern prevails for at-fault accidents 
(line 2), fatal and injury accidents (line 3) and at-fault, fatal and injury 
accidents (line 4). Figure 2 shows similar results. However, this time the 
relationship between convictions in the first two year period and the average 
number of accidents in the subsequent two-year period is plotted. 


These two figures leave no doubt that convictions and accidents are 
correlated. However, correlation should not be taken to indicate causality. It 
is, in fact, very likely that the number of accidents in which a driver is 
involved and the number of convictions which a driver incurs are both related 
to the distance driven. Thus, the observed statistical correlation reflects a 
hidden common link, exposure, about which we have no information. 


It deserves noting that about 2/3 of all drivers have no convictions in a two- 
year period. We are interested mainly in the accident potential of drivers 
who have convictions. In addition, the difficulties of computing are greatly 
reduced if only that 1/3 who had one or more convictions are used for data. 
It is for these reasons that the core of the analysis described in Section 4 
used the records of’ drivers who in the first two years had one or more 
convictions. 


3.5 Selection of Conviction Categories 


A preliminary analysis of a sample of about 8000 drivers indicated that, over 
the four-year period 1981-1984, these drivers were convicted for 
approximately 200 different traffic offenses. Speeding accounted for some 
60% of non-accident related convictions; seat-belt offenses accounted for about 
10%, and failure to stop at an intersection for about 5%. Most of the other 
offense types had very few convictions. It was obviously impractical to 
estimate how much these "leaner" convictions added to a driver’s accident — 
potential. It was therefore necessary to aggregate offenses with few 
convictions into larger groups. (It should be noted that the same approach is 
already present in the current demerit point-system. There are essentially 8 
categories of offenses, that. is, those assigned 0,1,2,3,4,5,6 and 7 demerit 
points. For example, all the non-moving violations as well as some of the 
moving ones fall into the 0 point category.) 


Determining Conviction Categories: Step 1 


The clustering of all possible offense types into a manageable number of 
groupS was accomplished in several steps. The first step was to combine 
the offense types which are quite similar in nature into a smaller number of 
categories. A total of 45 such categories were established in our discussions 
with MTO in April 1987. These consisted of 38 categories of moving 
violations, 3 for  vehicle-related offenses, and 4 for non-moving, 
administrative violations. (See Table 5.) 


This first round of clustering was based on judgment of similarity. In the 
second round, groups of offenses were identified which were similar in their 
contribution to the average number of accidents of a driver. To obtain 
results which are unambiguous and free of confounding, the only records used 
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No. of 3 yr. Weighted 95% Limits 


Category Brief Description Drivers Accs. Mean Upper Lower 
n1 Minor neglect, licenses, permits 6495 2918 6.434 8.445 9.422 
n2 Neglect,insurance,permits,etc. 1589 719 8.414 9.438 8.392 
n3 License suspended,HTA 874 454 8.424 8.456 6.394 
n& Learners 34 18 B.343 8.582 8.212 
v1 Minor veh.;lomps,noise 2954 1498 8.468 8.485 8.451 
v2 Brakes, tires,unsafe vehicle 946 451 2.4998 8.430 9.371 
v3 Comm.veh;size & weights 583 369 §.542 9.583 9.580 
m1 Seat belt 12337 4858 %.376 9.384 8.368 
m2 Speeding | 173592 55211 $2519 0.321 9e3s17 
m3 Careless driving 982 342 @.327 6.357 8.299 
m4 Slow driving 45 aa 8.119 8.237 6.855 
m8 STOP sign, ROW violations ~ 14824 3935 g.288 $2295 85281 
m9 PXO violations Use W/ 355 @.296 $.328 8.272 

mig Turns violotions;right,left,U 18231 4942 @.285 9.289 8.277 
mi1 Unsafe move;open door 1649 542 8.334 iJ meteter (heme. 
mi3 Disobey red light 13731 4278 8.313 9.321 8.386 
m14 Amber light 3453 982 @.285 6.299 $.271 
m15 Advance green 274 73 %.265 P2517 707215 
mi6 Fail to share road 178. 62 @. 303. 9.372 8.242 
mi7 .Passing violations 1395 459 9.327 O2551) B25 05 
m18 Wrong-way one way street 1582 458 8.284 G.386 8.264 
‘mi9 Improper driving divided h’way 2599 982 0.361 0.379 9.344 
moO Se reales Comme 934 337 G.344 9.374 8.316 
m21 Emerg. veh., school x’ing 48 15 6.159 9.288 9.884 
m22 R/R crossing violations oom 35 G.314 0.488 2.233 
m24 Headlight beam not lowered Zen 73 9.262 §.318 8.289 
m25 Improper parking 145 77 9.497 9.484 8.334 
M26 uno. lS tLOpet Oe ScCnOolsDuUS 624 135 §.281 9.317 8.249 
m28 Disobey traffic signs 16598 529 Sasze 0.344 8.301 
m29 Fail report accident 224 73 @.266 8.324 8.215 
m3 Fail remain at scene 236 . 75 of eM) 2.373 9.261 
m32 Dangerous driving C.C.C. eh 2 9.919 B.421 6.908 
m33 Fail remain at accident C.C.C. 66 39 @.377 0.491 8.274 
m34 Dangerous driving C.C.C. 89 46 8.281 §.376 §.262 
m35 Impaired driving C.C.C. 2381 1848 0.443 8.462 6.424 
m36 Foil/refuse breath test C.C.C. 94 37 8.218 9.396 9.149 
m37 Foil or ref. breath test C.C.C. 1208 56 6.314 0.397 6.241 
m38 Driving with >88 mgs. alcohol 3676 1582 8.386 8.491 9.371 
m41 Crowding driver seat 1208 36 2.280 @.362 8.211 
m44 Radar device in vehicle 68 27 8.251 0.359 6.166 
m45 No safe helmet, motorcycle 96 47 %.266 0.357 9.191 
m46 Fail to signal to stop 18 6 0.980 2.277 8.819 
m47 FTC, commercial vehicle 67 35 @.362 9.475 8.262 
m48 Fail to stop for police officer 12 6 2.986 G.348 8.616 
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were those for drivers who in any one of the four years had a single 
conviction. The results given in Table 5 are based on the entire sample. 
Thus, for example, there were 12337 drivers who in one year were convicted 
for not wearing a seat belt (offense ml) and had no other convictions in that 
year. During the remaining three years these drivers recorded 4858 accidents 
for an average of 0.394 accidents per driver. To eliminate any bias due to 
differences in the age-gender distribution which might be associated with 
specific offenses, all averages were recalculated for a "standard population" 
(for more detail, see Appendix 1). The "standard population" used had an 
age-gender composition which was characteristic of those drivers who had 
exactly one conviction of any kind in one of the four years. This is why in 
the "Weighted Mean" column the average number of accidents in the remaining 
three years associated with this seat belt offense is listed as 0.376. 
Similarly, the 173,592 drivers who had only a single speeding conviction (m2) 
in some year have an adjusted average of 0.319 accidents in three years. 
The last two columns give 95% confidence limit for the weighted mean. 


Categorization of Speeding Offenses: The current point allocation system 
ST SRAIe four levels of severity according to how much the speed limit is 
exceeded. However, exceeding the speed limit by 20 km/h, whether in a 40 
km/h or a 100 km/h speed zone, carries the same number of demerit points. 
Since there were many speeding convictions in the data base, the question of 
how the average number of accidents depends on the speed limit and on the 
extent to which the speed limit is exceeded was examined. : 


Figures 3 and 4 show the frequency of violations for various speeds over the 
limit. It seems clear that there is a tendency for "speeds over limit" to be 
rounded to multiples of 5 km/h. However, the largest numbers of violations 
are for 14 km/h over the speed limit. Since demerit points are given for 15 
km/h or more over the speed limit, it appears that either drivers consciously 
drive just below this number or, more likely, that there is some mitigation by 
police officers for speeds just beyond 14 km/h over the speed limit (perhaps 
to account for inaccuracies in radar gun and speedometer readings). A 
similar situation exists, though less pronounced, for 29 km/h over the limit, 
just below the 30 km/h at which demerit points are increased from 3 to 6. . 


As a result of the anomaly noted above, dividing speeding convictions for the 
purpose of determining the relative seriousness of various "speeds over limit" 
can produce misleading results. In a division based, say, on the current 
system, most convictions in the 0-14 km/h over the limit category should 
really belong to the seemingly more serious 15-20 km/h over limit category. 
Similarly, the 15-29 km/h category may contain many convictions that belong 
in the 30 km/h or more over the limit category. Thus regression weights 


for each category might wrongly indicate that the 3 categories are similar in 
terms of seriousness. 


We have examined in detail whether drivers who have in some year exactly 
one speeding conviction of given gravity differ in their average number of 
accidents during the remaining three years. The results are given in Table 
6. It turns out that there is no systematic or practically significant 
difference between the average number of accidents either by speed limit 
zone or by the amount by which the speed limit is exceeded, or by the 
combination of both criteria. 
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Speed Above No. of 3 yr. Weighted 95% Limits 


Limit Limit Drivers Accs. Mean Upper Lower 
All G-15 123224 55755 %.293 %$.296 9.291 
All 16-29 79589 22479 %.286 G.289 9.283 
All 32-49 25891 Zoo %.297 %.392 9.291 
All >49 894 371 %.315 G.344 9.286 


28-48 9-15 24264 6307 6.288 $.293 $.283 
58-78 6-15 86248 22637 $.294 $.297 9.291 
80-198 6-15 18712 5411 G.292 §.298 9.286 
20-48 16-29 5276 1413 §.281 $.293 $.278 
58-78 16-29 39995 11427 $.293 4.297 9.288 
80-198 16-29 34318 9639 .281 9.285 9.276 
29-48 38-49 683 185 6.269 6.362 $.239 
58-70 30-49 9304 2914 9.385 6.314 9.296 
88-198 38-49 13984 4232 6.292 9.300 9.285 
29-49 >49 14 8 6.186 6.432 9.964 
59-76 >49 495 183 9.341 9.386 9.299 
80-186 >49 475 186 6.272 G.312 9.236 


Determining Conviction Categories:Step 2 On the basis of such comparisons, it 
was ae to combine some of ee 45 conviction categories which were 
found to be associated with similar average numbers of accidents. The 
resulting 14 conviction groupings, the associated three-year average number of 
accidents and 95% confidence limits of the estimate are shown in Table 7 and 
plotted in Figure 5. Also shown are estimates of accident potential for 


-conviction-free drivers, those who had no convictions of any type during one 
calendar year. 


In summary, the final 14 conviction categories to be used in analysis were 
established on the basis of the following considerations: 


1 Conviction types within each category were similar in nature; 


Ph: The accident potential associated with each conviction within a category 
was Similar in size; 


3. The number of drivers having offenses in each category was 
considered sufficient to provide a reliable estimate of accident potential for 
that category. 


A list showing which convictions are included in the first 14 conviction 
categories is given in Appendix 2. 


3.6 Discussion of Conviction Categories 


Inspection of Figure 5 leads to the question of why it is that convictions for 
some of the innocuous offenses (e.g. for a burnt-out light bulb, nnl) are 
found to be associated with more accidents than offenses traditionally deemed 
very dangerous such as speeding (mm7) or running a red light (mm2). 
Several reasons combine to explain this apparent paradox. First, convictions 
associated with most accidents are those characteristic of trucks. Since 
trucks cover 10-20 times the distance of a passenger car per year, it is to 
be expected that they will have, on the average, more accidents. The second 
reason is easier to explain through an example. Assume that 1000 run-the-red 
offenses lead to 5 accidents and that 1000 fail-to-signal-turn offenses lead to 
1 accident. Thus, we are assuming that to run-the-red is a more dangerous 
offense than failing to indicate a turn. However, the enforcement of both 
offenses is unequal (perhaps because one is assumed to be more dangerous 
than the other). Assume further that of the 1000 run-the-red offenses, 10 
lead to a conviction whereas of the 1000 fail-to-signal offenses 1 leads to a 
conviction. Thus, a figure similar to Figure 5 would show 5/10 = 0.5 
accidents per conviction for running the red and 1/1 = 1.0 accident per 
conviction for failure to indicate a turn. Even though, according to our 
starting assumption, running the red is five times more dangerous than failing 
to indicate a turn, because enforcement of the two offenses is unequal, the 
final results show the contrary. The problem is caused by the fact that the 
driver record contains information about the number of convictions, not the 
number of of illegal actions committed by a driver for which he is not 
convicted. This is the data with which we have to work. A third reason 
may be related to the connection between different types of behaviour and 
convictions. The incidents on a person’s driving record, convictions and 
accidents, may be indications of their overall driving behaviour. As a result 
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the types of convictions committed by certain types of people may also 
provide insight into their potential for accidents. If a person engages in 
certain behaviours which lead to certain convictions they may also engage in 
certain other behaviours which predispose them to accidents. To illustrate 
with an example, drivers with environmental types of convictions (e.g. noisy 
muffler) were found to have a higher weighted mean of accidents than 
drivers with most other types of convictions. Most drivers quickly have their 
noisy muffler fixed and are unlikely to receive this type of conviction. The 
attitude which results in drivers coming to the attention of the police and 
being charged with this offense may be related to a similar careless attitude 
towards behaviour which results in accidents. 


All of the three reasons discussed above affect the correlation between a 
particular offense category and accidents. Thus, correlation does not 
necessarily indicate causality. Two conclusions follow. First, one should not 
interpret the results in Figure 5 as providing information about the danger 
inherent in this or that offense. Second, one should not be surprised when, - 
later, in section 4, the same innocuous offenses will prove to be strongly 
related to the driver’s estimated accident potential. 


3.7. Age and Gender Categories 


It is well known that the average number of accidents for a driver depends 
on gender and age. To account for this fact, age and gender will be used in 
the analysis as "variables". It is relatively simple to account for gender 
since it comes in two natural categories. However, the relationship between 
age and number of accidents is continuous in nature and distinctly non-linear 
as is shown in Figure 6. To include age in the analysis it was necessary to 
establish a number of age categories. After careful analysis, the boundaries 
between age groups were chosen so that the average numbers of accidents 
within each group remained nearly constant while still keeping sufficient 
numbers of drivers within each age category to maintain statistical reliability. 
The following eight age categories were chosen: 


<21 

21-25 
26-30 
31-35 
36-40 
41-50 
51-60 
>60 


3.8 Exclusion of Drivers with Suspended Licenses 

Some drivers in the sample had their license suspended during the study 
period. Most of these are drivers who had many convictions which carried 
demerit points. The extent to which a suspended driver curtails his or her 
driving is unknown. This leads to serious difficulty in the statistical 
analysis. Consider a driver who in the first two-year period had many 
convictions and was suspended. That driver can be expected to drive less in 


the second two-year period and therefore to have proportionately fewer 
accidents. 
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In the statistical analysis this would tend to create a negative correlation 
between accidents and convictions. That is, it would lead to the incorrect 
result that the larger the number of convictions in the first two years, the 
fewer accidents a person is likely to have in a subsequent period. The net 
effect of this difficulty is to distort the results of analysis in some 
unpredictable way. In fact, in the initial statistical analyses, this distortion 
was so large that negative weights were produced for criminal code offenses 
for which drivers were likely to be suspended. (The implication of a 
negative weight is the paradoxical result that the offense is associated with 
fewer subsequent accidents.) As a result of this finding, it was necessary to 
remove from the data set and from subsequent analysis those drivers whose 
license had been suspended during any time in the period 1981-1984. The 
result of this decision is that whatever results are obtained in the course of 
the analysis apply directly only to those drivers whose licenses have not been 
suspended. The extension to drivers who were suspended under the present 
demerit point system is therefore an extrapolation. 


4.0 ANALYSIS AND RESULTS 


The various activities described so far (checking representativeness, 
conducting consistency checks, the exploratory analysis of data, the selection 
of conviction categories, determination of age groups and the removal of 
suspended ATES) are all preliminary to the main activity, namely the 
establishment of a relationship between information contained in a driver’s 
record and his or her expected number of future accidents. The information 
used was a driver’s gender, age, the count of accidents (at-fault, not-at-fault 
or total), and the count of convictions in each of the fourteen categories. 
This information from the first two years was used to estimate "weights" 
which best fitted the accident record in the second two-year period. Weights 
which are associated with each conviction category can be taken to be relative 
numbers of "points" which should be assigned for the best prediction of the 
likelihood of an accident in the second two-year period. Only records of 


those drivers who had at least one conviction in the first two-year period 
were used (n 170,000). 


4.1 Schemes and Variants Examined 


The MTO was interested in a number of variants, each using different sets 
of variables. These fall into three categories: 


1) models which made use of age and gender information and 
models which did not; 


2) models which assigned points for period 1 accidents (with 
. the further distinction between at-fault, not-at-fault and total), 
and models which did not assign points for accidents; and 


3) models which assigned different numbers of points for 
each conviction category and models in which all convictions 
carried the same weight. 


In total 16 different combinations of variables from the first two-year period 
were used. For each of these, "weights" were estimated. Each of these 16 
combinations results in a prediction equation which is termed a "model". 
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Table 8 shows which variables were used in each "model". Thus, for 
example, model B2 calculates the expected number of period-two accidents 
using the count of period-one convictions in each of the 14 types (see Table 
7) as well as the count of total period-one accidents; model C4 uses driver 
age and gender, the count of all convictions (without distinction with respect 
to type), the count of at-fault accidents and the count of not-at-fault 
accidents. 


Half of the models tested (Al-A4, B1-B4) allowed for a different weight to 
be given to each of 14 conviction types. The other 8 models tested used a 
weighting of 1 point per conviction, that is, all 14 conviction types were 
considered equivalent (models C1-C4, D1-D4). Half of the models used age 
and gender as variables, to aid in the prediction of accidents in the second 
two year period (models Al-A4, C1-C4). 


One quarter of the models (Al, Bl, Cl and D1) used only information on 
convictions in the first two year period to predict accidents in the second two 
year period. One quarter of the models (A2, B2, C2 and D2) used total 
accidents in the first two year period as well as convictions, to predict 
subsequent accidents. One quarter (A3, B3, C3 and D3) used only at-fault 
accidents and convictions for prediction purposes. Finally, one quarter (A4, 
B4, C4 and D4) used separate weights for at-fault and not-at-fault accidents, 
as well as convictions iss separately weighted: A4, B4 or one point per 
conviction: C4, D4) to predict subsequent accidents. 


4.2 What is a "Model" and how does it "Work" 

By "model" we mean a mathematical equation which allows the estimation of 
a driver’s expected number of accidents in "period two" on the basis of that 
driver’s age, gender, record of convictions and record of accidents in "period 
one". Like many others, we have elected to use the linear-sum form. 


To illustrate, consider a female driver, 24 years of age who in the first 
period had two speeding convictions, one conviction for failing to yield the 
right of way (ROW), and one at-fault accident. To estimate her expected 
number of period two accidents we consider the importance of each variable. 


Variable Weight Count Contribution 
Base Driver 0.176 ~ +0.176 
For being female - 0.061 - ~  -0.061 
For being 24 yrs. - 0.039 - -0.039 
For speedg. conv. 0.027 x2 +0.054 
For ROW convic. 0.027 xl +0.027 
For accident 0.058 xl +0.058 
Estimate of expected number of period two accidents =0.215 


a3 


TABLE 8: VARIABLES USED FOR REGRESSION RUNS 
(x -- indicates variable used in run) 


Run _ Age and sex Variables Variable Accident variables 
dummy for each for totol ------=---------------- 
variables Conviction Convictions Ae oR A soisiine » Fae, Pamalie 

Group 
eae b <a pO SS 
A2 x x x 
A3 x x x 
A& x x x x 
pee ne a ene WPA 
B2 x x 
BS x x 
B4 x x x 
cho Cielh age een See 
C2 x x x 
C5 x x x 
C4 x x x x 
Dita sary ako NSslh iRaei i pana WR Chote mC 
D2 x x 
D3 x x 
D4 x x x 


24 


The "weights" in this illustrative example are for Model A2. (See Table 9.) 
Thus, we estimate that females who were 24 during 1981 and had during that 
period of time the aforementioned record of convictions and accidents, should 
be expected to have 0.215 accidents during 1983-1984 on average. 


4.3 On the Estimation of "Weights" and "Diversity" 


The main statistical work is the estimation of "weights". These were 
estimated for each model separately. For example, the weights for model A2 
are given in the "Estimated Coefficient Column" of Table 9. The magnitude 
and relative importance of the weights for model A2 is shown in Figure 7. 
A full collection of such tables for all 16 models is provided in Appendix 3. 
In Appendix 4 we give fuller technical details about the distinctive features of 
the estimation procedure and the software used. 


The model weights allow us to estimate the expected number of accidents of 
an average driver with known age, gender and record of convictions and: 
accidents. However, while this is true for an "average" person fitting such 
traits, any specific driver in this group will have her own "expected number 
of accidents". Our statistical procedure also allows us to describe the 
diversity within each group. Having obtained estimates of the "weights" for 
a model we have examined the data again in order to establish the standard 
deviation of what we estimate to be the expected number of accidents for a 
driver with given traits and record. It turns out that the estimate of standard 
deviation is equal to the estimate of expected number of accidents multiplied 
by a constant. The list of "constants" for the 16 models is given below. 


Model Constant 


Al 0.67 
A2 0.64 
AS mae 00 
A4_ 0.64 
Bl Ort 
B2aea.05 
BS aee-62 
B4 ss 0..68 
Cl 0.67 
2m 0.05 
C3 0..67 
C4 0.65 
D1 0.72 
D2 ~=s0..69 
DomeemO ss 1 
0.68 
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Thus, for example, the standard deviation of “expected number of accidents" 
for females of the aforementioned group is estimated to be 0.215x0.64 = 
0.138. The "constants" above are an index of model performance. The 
larger the constant, the larger the standard deviation, and the less accurately 
we know the accident potential of a certain driver. The ability to describe 
the diversity of "accident potential" within specific groups of drivers will 
prove important when we analyze the performance of the current demerit 
point scheme and compare it to the performance of other "models". 


4.4 The Accident Potential of Ontario Drivers 

The models estimate, for each driver, the "number of accidents he or she is 
expected to have per year". This is not an integer because its nature is that 
of a "long-term-average". For brevity this number is called a driver’s 
accident potential. Of course, not all drivers have the same _ accident 
potential; some drive more, some drive less, some take risks, others are 
more cautious. Prior to examining the results for each model, let us examine 
the diversity of accident potential in the population of Ontario drivers. This 
will reveal how many drivers there are in the population who have a high 
accident potential. How many of these "high accident potential” drivers will 
indeed be identified for post-licensing-control under the current demerit point 
system and.the new weighting models, will be examined later. 


The number of accidents in the second two-year period was used to compute 
the mean accident potential (0.055 accidents/year) and standard deviation of 
accident potential (0.055 accidents/year) in the total driver record sample. 
Details of the method are given in Appendix 4.3. This information was then 
used to plot the distribution of accident potential in a population of 5 million 
Ontario drivers shown in Figure 8. 


Using Figure 8 one can estimate how many drivers in the population are at 
each level of accident potential. Thus, for example, it can be seen that about 
1.75 million drivers, or 1/3 of the drivers, have an accident potential which 
is higher than the average. (It is less than 1/2 of the population because 
there are some drivers with a very high accident potential). Let us now find 
out how many drivers have an accident potential which is much higher than 
the mean for this population. Consider drivers whose accident potential is 3 
standard deviations above the mean or higher, that is, an accident potential 
larger than 0.22 accidents/year. From Figure 8 it can be seen that almost 
90,000 drivers have an accident potential of 0.22 or higher. 


It is drivers with a high accident potential who are to be identified for post- 
licensing-control. However, it must be noted that a driver may have a high 
accident potential because of bad driving as well as because of high exposure 
to risk. .If a driver is likely to have x accidents driving y kilometers, the 
same driver is likely to have 2x accidents driving 2y kilometers. Thus, it is 
very important to ascertain whether one has identified, through the demerit 
point system, a driver who drives little but has a high accident potential or 
one who drives a lot and has a correspondingly high accident potential. The 
two drivers may require different treatments. For brevity we will denote 
accident potential by "m". 
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Ordinarily one is interested in the identification of drivers with unusually 
high m’s. Figures 9-11 are successive enlargements of the right-hand tail of 
the curve in Figure 8. The reader must keep in mind the fact that the shape 
of the curve reflects the assumption that the distribution of m’s is Gamma. 
We cannot be sure that this assumption is valid until we show that we can 
reproduce the actual accident data by using it. For example, if accidents 
were distributed in a bimodal fashion because people drove either a little or a 
great deal, and had either very few or a lot of accidents, then our assumption 
of a Gamma distribution would be incorrect and we would not be able to 
reproduce what is observed in terms of accident distribution by using the 
assumption. However, if accident data is distributed in a unimodal fashion 
then our experience is that the gamma distribution is a good approximation. 


In Appendix 4 (Tables A4.1 through A4.25), we show the results of the check 
on the validity of our assumption of a Gamma distribution. For example 
males aged 21-25 years (Table 4.12) have a mean number of accidents in 
period one of 0.2049, and a variance of 0.2231. Using the Gamma 
distribution, with this mean and variance results in predictions which closely 
match what has been observed. Thus, for example, for drivers with 0 
accidents in period one, we predict that there will be 45249.8 drivers who 
have Q accidents in period two. The observed number of such drivers was 
45054. The similarity between observation and prediction is consistent for 


the various age and sex groups. This indicates that the Gamma assumption is 
reasonable. 


The information contained in Figures 8-11 enables one to form an idea about 
the number of drivers with unusually high m’s in the population. The 
problem is, of course, that these drivers are difficult to identify on the basis 
of their conviction and accident record. A driver may have a relatively low 
m while accumulating a large number of convictions and even accidents in a 
certain period of time. Conversely, a person who, in a very long period of 
time, would be recognized as having an unusually high m, might in a one or 
two year period have no accidents and no convictions. The models obtained in 
Section 4 can be thought of as a net; their purpose is to fish out those 
drivers who have an unusually large m. However, even the best models 
formulated in Section 4 are but an imperfect net for the identification of 
high-m drivers. Many such drivers will slip through its holes; many of those 
identified will not be high-m drivers. The degree of success and failure of 
the net will be examined next. 


RSS The performance of the current demerit point system and of the new 


ee ie 


models ae 


Since the 16 new models were derived using appropriate statistical methods, 
rather than weighting each offense according to its perceived seriousness, 
they should perform better than the current demerit point system. However, 
all models face the same inherent difficulties as the current demerit point 
system. Namely, because of the randomness inherent in the process of 
accident occurrence and the randomness inherent in the process by which 
drivers acquire convictions, a two year record is just too short to accurately 
estimate a driver’s accident potential. As will be seen, the new models are 
an improvement on the current system, but like the current system, still fail 
to detect most of the high accident potential drivers. 
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Two measures of performance will be used to judge the quality of a model. 
The first is the number of period two accidents which have occurred to 
10,000 drivers selected as worst by a model. The model which identifies 
drivers with most period two accidents will be judged the best. 


The second measure of performance gives more detailed information. It tells 
us how many of the drivers estimated to have a high accident potential on the 
basis of their period one record are truly high accident potential drivers. 
These are the "hits". We also learn how many of these drivers have a true 
accident potential which is lower than the population average. These are the 
"false alarms". 


The first measure of performance is straightforward. Consider, for 
example, the 10,000 drivers who. in the first two year period had most 
demerit points. Checking the accident records of the same drivers, we find 
that during the second two year period they had 1452 accidents per year (see 
sum of entries in last row of Table 10). 


Consider now another group of 10,000; this time those who who in the first 
two year period had the most accidents. For this second group we find that 
they recorded in the subsequent two year period 1828 accidents per year (1828 
is the sum of the first three entries for accidents, that is, 312 in the first 
1000, 756 in-the next 4000, and 760 in the next 5000). Evidently, it is better 
to identify drivers by their previous accident record than by previous demerit 
points. Imagine now that a third group of 10,000 drivers is identified, this 
time those for whom model A4 estimates the highest accident potential on the 
basis of their age, gender, as well as convictions and accidents in the first 
two year period. This group has 2084 accidents per year in the second 
period. Thus,. selection by model A4 gives a richer catch than either 
selection by previous accidents or by the current demerit points. In 
interpreting these results one has to keep in mind that the count of accidents 
is always subject to random fluctuations. 


Several conclusions emerge. First, one can do better than to use the current 
demerit point weights. Second, it is important to make use of the driver’s 
record of accidents. This is already evident from the comparisons of the 
last two rows. However, it also emerges from the poor performance of 
models Al, Bl, Cl and D1 which do not make use of accident data. In fact, 
the top 1000 drivers can be well identified by their previous accident record 
alone. Third, the more drivers that are identified, the lesser the "yield". 
Thus, the top 1000 drivers have an accident rate of ~0.3 accidents/year which 
is approximately 6 times the population average; for the first 10,000 drivers 
the average accident rate is ~0.2 and so on. 


The first measure of performance, examined above, leaves the impression 
that the drivers which are identified indeed have an accident potential which 
is substantially larger than that for the population of all drivers. While this 
is true for the group "on the average", this group ‘itself may not be a 
homogenous one. The second measure of ‘performance by which the quality 
of the alternative models is to be judged relates to the diversity of accident 
potential within the group of drivers which these models identify. 
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Table 10 


ia aa a a a er ee mS a a ew a oe 


a aa a a a ra a a a Se a a ws a ree a a a soe ee 


Model . Drivers estimated by model to be in 
Top 1,000 Next 4,000 Next 5,000 Next 10,000 Next 100,000 Total 


a a a a a ee ee ek ee 


Al 188 als 904 1660 13308 POvs 2 
A2 324 85 936 woe 14192 18016 
Az . 276 736 8&0 Ae: 13700 17284 
A4 ee 2 8468 B96 1736 14060 17880 
Bl EBs 704 824 1548 12888 16176 
B2 a2 ire 980 1628 12688 17448 
BS 7 2 748 B04 bile Ime 56 16872 
B4 304 876 ct 1636 12684 17456 
La 208 744 760 Le Tea aa ee: 16220 
C2 ee B08 900 1672 TS 72 17468 
CS PR | 780 780 1424 12684 16944 
C4 oo BO4 ae 1616 13748 17452 
ug ® 176 748 688 1432 12250 15504 
-D2 Stale B24 - 784 1608 15258 16836 
DBs 244 To 852 1360 13084 16296 
D4 564 840 788 1574 Eee Re 16784 
Accs.% EA 736 769 1432 10780 14040 
DP xk 150 640 632 Not Available 

¥ - Drivers with the Propece accident counts in period 1 were selected 


xk - Drivers with the highest demerit pcints acquired in period 1 
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Second Performance Measure: Identifying Hits and False Alarms: A 
weighting scheme is like a net with which we attempt to catch drivers 
who, based on their two year record, are likely to have an unusually high 
number of accidents in the next two years. For the illustration here, 
consider "unusually high" to be 3 standard deviations above the mean. That 
is, we are hoping to identify drivers whose accident potential is larger than 
0.22 accidents per year. If we manage to identify such a driver we will call 
this a "hit". Conversely, if based on the two year record we identify, and 
call in for treatment, a driver whose accident potential is lower than what 
is the average accident potential in the population, that is 0.05 accidents per 
year, we will call this a "false alarm".. 


Determining the hits and false alarms for each weighting scheme, or model, 
is a lengthy and rather complex procedure. It is not simply a matter of 
counting how many drivers had accidents in the second two year period, and 
how many did not. An.explanation of the process, in general terms, follows. 
Let us consider one driver, out of a sample of 827,995, Mr. Smith. Mr. 
Smith’s record of convictions and accidents in the first two year period is 
used to estimate his expected number of accidents per year in a subsequent 
time period, using one. of the point schemes or models that have been 
developed, for example, A4. (See section 4.2 for details about the estimation 
of expected numbers of accidents using a driver’s record and model A2). 
This process is then repeated for all other drivers. Now let us select those 
1000 drivers who are estimated (say, using model A4), to have the highest 
expected number of accidents. (In the current demerit point system this is 
equivalent to selecting the 1000 drivers who have the highest number of 
demerit points). Let us suppose that Mr. Smith is among these. 


Among the 1000 drivers there will be other drivers of the same age group, 
sex, and record as Mr. Smith. These drivers, though similar to Mr. Smith 
in these three characteristics, will drive different amounts, in different- 
driving styles and are expected to have different numbers of accidents. For 
example, some who are in fact good drivers may have been in accidents or 
received convictions as a result of acts they seldom commit. These are 
expected to have few accidents in a subsequent period. Others in this group 
are in fact poor drivers and are expected to have a large number of accidents 
in the subsequent period. Thus the population of drivers who are in the same 
age group, sex and driving record as Mr. Smith in the first two year period, 
in fact have very different expected numbers of accidents. However, to the 
model, these people appear to be the same and thus are assigned the same 
estimated expected number of accidents. 


The model reflects the diversity of real accident potential in this group, by 
not only telling us the average expected number of accidents, but also the 
variance of expected number of accidents in this group. Thus for Mr. Smith 
and drivers of the same age group, sex and record there is a distribution of 
expected number of accidents which is estimated by our model. As shown in 
Figure 12 for this group of drivers, with an average expected number of 
accidents of 0.25, 60% in reality have an expected number of accidents of 0.22 
(hits) or more while 2% at the other end of the distribution have in reality an 
expected number of accidents of 0.055 or less (false alarms). 
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Figure 12 
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Note: These graphs are for the purpose of illustration only and are not meant to 
accurately represent particular data sets. 


ay! 


‘Now consider another driver, Ms. Jones, who belongs to a different group of 
drivers amongst our population of 1000 drivers with the worst records. In 
this group the average expected number of accidents is different from Mr. 
Smith’s group (though it is still high), and the curve of the distribution of 
actual expected numbers of accidents is different (See Figure 12). Here it 
may be that 4% of the drivers have a low expected number of accidents of 
0.055 or less (we call these false alarms), while 40% have an expected 
number of accidents of 0.22 or more (which we call hits). 


In order to determine the total number of hits (drivers with an expected 
number of accidents of 0.22 or more) in the population of 1000 drivers with 
the worst records, we add the numbers of drivers in each of the various 
groups (the Mr. Smith group, the Ms. Jones group etc.) who would have more 
than 0.22 expected accidents. Let us assume there are 20 drivers in Ms. 
Jones group, and 50 drivers in Mr. Smith’s group. In the case of Mr. Smith 
group there will be 60% of 20 drivers, or 12 hits. In the case of the Ms. . 
Jones group there will be 40% of 20 drivers, or 8 hits. By adding up the hits 
we obtain the numbers in Table 11, that is 528 hits in the 1000 worst records 
selected by model A4, 1568 hits in the next 4000 worst records etc. 
Similarly, we can add up the false alarms in each of our distributions. 


For illustration purposes, models A4 and B4 were selected. (The variables 
which these use to estimate accident potential are shown in Table 8.) 
Drivers were ranked in terms of accident potential estimated by each model, 
based on their record during the first two-year period. Table 11 shows the 
hits (drivers correctly determined to have accident potential larger than 0.22 
accidents/year) for the 10,000 drivers from a population of 5 million Ontario 
drivers with the highest accident potential as estimated by each model. 


As Table 11 shows, in a group of the worst 1000 drivers as_ selected by 
model A4, there are 528 hits and 39 false alarms. When drivers are selected 
by model B4, the number of hits increases to 541 but the number of false 
alarms correspondingly increases to 45. Thus, there is not much to 
differentiate the two models for the top 1000 drivers, probably because they 
consist largely of the same people. In the second 10,000 worst driver 
records, A4 gives 2679 hits and 933 false alarms while B4 gives 2657 hits 
and 1110 false alarms. Thus, A4 is not much better than B4 in terms of 
identifying the hits but is substantially better in terms of calling in fewer 
eh which are better than average; A4 tends to have fewer false alarms 
than 


Note that the yield of hits is above 50% in the first 1000 drivers and only 
~15% in drivers ranking between 20,000 and 120,000. Similarly, there are 
only 3-4% false alarms in the top 1000 drivers and this increases to 14-17% 
in the last row. Thus the larger the group identified, the worse the 
performance of the net. 


Now let us consider another aspect of how our net performs. How many of 
the drivers in the population who have an unusually high accident potential will 
slip through the holes? Earlier we showed that there are drivers with bad 
records who belong to a group with a high average accident potential, but who 
themselves belong to the end of the distribution where the probability of an 
accident is low. Similarly there are many drivers with average or good 
driving records who belong to groups with low average accident potential but 


38 


Drivers estimated Number of drivers expected to have 
by model to be in ma 0222 m< $9.95 
Model A& Model B4& Model A& Model B4& 


the top 1,080 528 541 39 45 
the next 4 , BOD 1568 1595 246 289 
the next 5,820 1661 162¢ 399 458 
the next 19,820 2679 2557 933 111@ 
the next 192,880 15987 15291 14198 16928 
TOTALS 128,980 22363 21724 15896 1883¢ 
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who themselves are in the upper end of the distribution and have a high 
accident potential. Such individuals will slip through the net. 


Looking again at Figure 8, it can be seen that out of 5 euition drivers, 90,000 
drivers have an accident potential larger than 0.22 accidents/year. Thus, as 
Table 11 shows, using model A4 to select the 10,000 drivers with the worst 
records will catch 3697 of the high accident potential drivers; calling in the 
next 10,000 will identify 2679 more hits. Calling in the next 100,000 will 
yield another 15,987 hits. Thus even after those 120,000 drivers of 5 million 
who have the highest accident potential according to model A4 have been 
selected for treatment, only 22,363 hits can be expected. Of the 90,000 
drivers in the population who have an unusually high accident potential (>0.22), 
still 67,637 remain unidentified. One would have to call in the whole driver 
population before all the hits would be identified. 


Table 12 compares performance among the 16 models, and the current. 
demerit point system, in terms of hits and false alarms for the worst 10,000 
drivers identified by each model. It should be noted that, while there will 
always be considerable overlap between groups of drivers identified by 
different models, there also will be systematic differences. . Thus, for 
example, the use of model series A and B will lead to groups which contain 
more truck drivers than the current system, simply because the current system 
does not assign any points for truck weight or truck dimension offenses, 
while models A and B weigh these heavily. 


While comparison in terms of hits and false alarms is good for purposes of 
illustration, it depends on a rather arbitrary definition of what is to be 
considered an "unusually high" accident potential. A more comprehensive way — 
to characterize the performance of different models is by continuous curves, 
as shown in Figure 13. 


In Figure 13, accident potential is measured on the horizontal axis. The - 
lowest curve represents the current demerit point scheme. In a group of 
10,000 drivers who in a population of 5 million have the most demerit points, 
one can expect to find 2800 who have an accident potential above 0.2 accidents 
per year. The highest curves represent models A4 and B4. In a group of 
10,000 drivers who in a population of 5 million have the highest estimated 
accident potential by model A4, one can expect to find some 4200 drivers who 
have an accident potential above 0.2 accidents per year. Thus, the higher the 
curve, the better the net. 


While the current demerit point scheme is clearly inferior to the new 
schemes, there is not much to distinguish between the other four models in 
Figure 13. A4 and B4 give a separate weight to each of 14 convictions and 
are somewhat better than C4 and D4 which assign the same weight to all 
convictions. A4 (which uses age and gender information) is slightly better 
than B4 where accident potential is less than 0.15 accidents/year. B4 is 
slightly better than A4 for accident potential greater than 0.15 accidents/year. 


In Figure 14 we show a second group of 10,000 drivers, those who rank in 
the 20,000 to 30,000 worst records. That is, these drivers’ records are bad, 
but not as bad as the 10,000 shown in Figure 13. The difference in the 
performance of the four models appears to be even less important. 
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TABLE 12 : Figures of merit for 18,868 drivers with highest m’s 
(for each model ) 


Model Number of drivers expected to have 
ah BAG), Fares m< 2.95 
Al 3258 988 
A2 3691 676 
A3 3449 817 
A4& 3698 674 
B1 Soon! 1%62 
B2 3756 86 
B3 weiter he) 923 
B4& 5157, 792 
C1 2911 1224 
C2 3411 756 
C3 S147 G22 
C4 3429 Vig! 
D1 2978 he 1a 
D2 3441 989 
D3 5155 1191 
D4 3451 9%6 
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5.0 SUMMARY AND CONCLUSIONS 


The purpose of this work was to use a driver’s record of convictions and 
accidents in order to predict, as well as possible, which drivers, based on 
their past record, are most likely to have an accident in the near future. 


Driver Record Sample. A sample of 827,995 records of drivers licensed to 
drive in Ontario San 1981-1985 has been examined. Each driver record 
contained information about the driver’s gender, age, and details of accidents, 
convictions, demerit points and suspensions. 


Preparatory Analysis. In preparation for analysis, the many hundreds of 
offense types had to be grouped into a manageable number of categories. We 
first put in the same group offenses which were similar in nature and then 
consolidated those offenses which were associated with a similar average 
number of accidents. It turned out that, for example, drivers who in one year 
had a single speeding conviction had fewer accidents in the remaining three 
years than other drivers who had a single conviction in that year for a 
relatively minor offense such as a missing lamp. This finding may be 
initially puzzling but, on reflection, helps to interpret later results correctly. 
It arises partly because not all illegal behaviours lead to convictions at the 
same rate, partly because some offenses are specific to truck drivers who 
drive 10-20 times as much as car drivers and tend to have proportionately 
more accidents, and partly because behaviour which results in a fairly 
innocuous offense such as a noisy muffler may be of the type which also 
leads to accidents. Therefore neither the ratio of accidents to convictions 
nor the "weights" which are later attached to particular offenses are an 
indication of the gravity of that offense. 


Drivers whose license is suspended will curtail their driving to some extent. 
This is why, during the period of suspension, one should expect some 
reduction in the number of accidents in which they are involved. However, the 
extent to which driving is curtailed is unknown. To assume that suspended 
drivers stop driving would introduce a bias into the analysis; assuming that 
they continue to drive would cause another bias. To protect the integrity of 
the results, drivers who were suspended had to be removed from _ the 
analysis. Therefore, our conclusions apply directly only to drivers who have 
not been suspended. The extension to drivers who were suspended under the 
present demerit point system is therefore an extrapolation. 


Alternate MS sieie, Schemes. We have examined 16 models to estimate the 
expected number of future accidents for a driver based on age, gender, 
convictions and accidents. The models differ from each other in the 
information which is used. Some make use of age and gender, others do not; 
in some, each of 14 types of convictions is given a different weight, in 
others all convictions have the same weight; in some, at-fault accidents are 
counted separately from not-at-fault accidents, in some they are lumped 
together. All 16 models have a common structure, that of a "weighed sum" of 
convictions, and in some models, accidents. The basis is always a 17-20 year 
old male with no accidents or convictions in the first two years. Thus, in one 
of the models, these drivers are expected to have 0.176 accidents in the 
second two year period. For being female one subtracts 0.061, for being 24 
years old one subtracts 0.039, for having two speeding convictions one adds 2 
x 0.027 for having an accident one adds 0.058, and so on. The result is an 
estimate of the accident potential of a driver with specific age, gender and 
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record of convictions and accidents. We also estimate the distribution of 
accident potential in the population of drivers with given traits. We make 
use of this distribution to describe the performance of alternative models. 


Measures of Performance. Two measures of performance were used to 
judge the quality of a model or weighting scheme. The first measure of 
performance is the number of period two accidents in a group of drivers of 
fixed size as identified by each model or weighting scheme on the basis of 
their first period record. The second measure of performance is the number 
of "hits" and the number of "false alarms" expected to occur in groups of 
fixed size as selected by the alternative models. 


Comparison of the Current Demerit Point System and Alternative Schemes. By 
the first measure of performance we find that those 10,000 drivers who in 
the first two year period had most demerit points, recorded during the second 
two year period 1452 accidents per year. Those 10,000 drivers who in the 
first two year period had the most accidents, recorded in the subsequent two 
year period 1828 accidents per year. A third group of 10,000 drivers, those 
who by model A2 had the highest accident potential when calculated on the 
basis of first period data, recorded 2116 accidents per year in the second 
period. Thus, selection by model A2 gives a richer catch than either 
selection by previous accidents or by the current demerit points. On this 
score, model A2 seems to perform best. 


Several conclusions emerge. First, one can do better than to use the current 
demerit point weights. Second, it is important to make use of the driver’s 
record of accidents. Third, the more drivers that are identified, the lesser 
the "yield". Thus, the top 1000 drivers have an accident rate of ~0.3 
accidents/year which is approximately 6 times the population average; for the 
first 10,000 drivers the average accident rate is ~ 0.2 and so on. 


We now turn to the examination of the second measure of performance. Not 
all drivers have the same expected number of accidents per year. Some drive 
more, some less, many drivers are prudent, some take unwise risks. Based 
on the accident data we show how many drivers in Ontario have what 
expected number of accidents. Thus, for example, of 5 million drivers, some 
90,000 have an expected number of accidents three standard deviations above 
the average for the population. It is these "high-accident-potential" drivers 
which a demerit point scheme aims to identify. A two year record of 
convictions and accidents is just too short to estimate a driver’s expected 
number of accidents with accuracy. This is because a given two year 
period may not truly reflect a driver’s typical driving behaviour. This is 
why some of those identified by the model as having the highest expected 
number of accidents, turn out in reality to be just average drivers. 
Conversely, this is why most high accident potential drivers may not have, in 
two years, a record. which identifies them as_ such. We showed, for 
example, that of the 10,000 drivers who by the "richest" model (A4) were 
estimated to have the highest expected number of accidents, 3698 (Table 12) 
have an accident potential in excess of three standard deviations above the 
mean for the population. These are the hits. At the same time, 674 of those 
10,000 were average drivers or better. Were we to identify a different 
group of 10,000 drivers, those who in a two year period had the most demerit 
_ points (using the current scheme), we would have found 1783 hits (Table 11), 
instead of 3698 by model A4, and 1465 false alarms, instead of 674 by model 
A4. Thus, the model A4 is considerably better than the current demerit point 
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instead of 3698 by model A4, and 1465 false alarms, instead of 674 by model 
A4. Thus, the model A4 is considerably better than the current demerit point 
scheme. It identifies more high-accident-potential drivers and fewer drivers 
who are better-than-average. 


Since A4 uses age and gender data, it is not fair to compare it to the current 
demerit point scheme. A fair comparison is with model Bl (no age, gender 
or accident information used) or B2 (accident information used but no age or 
gender). The performance of these three models for the 10,000 drivers with 
highest estimated accident potential is shown below: 


Hits False Alarms 
>0.22 <0.055 acc./yr. 


Current Demerit Points 1783 146 
Bl 


B331 1062 
B2 3750 806 


Comparison of Equal Point Schemes and Separate Weight Schemes. Finally it 
may be of interest to note that little is gained by giving different numbers 
of points to different offenses. Model Dl uses simply one-point-per- 
conviction and no accident data; model D2 uses one-point-per-conviction and 
1.88 points per accident. Model D2 is only slightly worse than model B2 


which assigns different numbers of points to each of fourteen offense 
classes. 


Hits False Alarms 
>0.22 <0.055 acc./yr. 
urrent Demerit Fouints 5 
D1 2978 1211 
D2 3441 909 


In summary, if the purpose of a demerit point system is to identify drivers 
who are most likely to have an accident, the scheme used now is not as 
efficient as alternative schemes would be. Even by giving equal weights to all 
convictions and a weight of ~2 to an accident (D2), one can do much better. 
It is important to use data about accident involvement, but it does not pay to 
differentiate between at-fault and not-at-fault accidents. 


For the top 5000 (or so) drivers, it makes little difference | whether 
separate weights are assigned to different offenses. However, for the next 
100,000 -drivers, separate weights improve performance (in terms of 
predicting the number of future accidents) by a few percentage points. 
Separate weights help to increase the number of hits and to reduce the 
number of false alarms by some 10%. 


Comparison of Schemes Using Age and Gender and Schemes That Do Not. 
For the top 5000 or so drivers, rhemuchinian of age and gender information 
appears to be unimportant. For the next 100,000 drivers consideration of age 
and gender improves performance (in terms of predicting future accidents), 
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Limitations of any Demerit Point Scheme. With all this, one has to keep in 
mind that eae a few drivers are identified (say about 10,000) 30-40% of 
those will be genuine high-accident-potential drivers and only 6-10% will be 
falsely identified better-than-average drivers. However, only 3% of all high- 
accident-potential drivers in the population will be in this group of 10,000 
drivers. It does not help much to increase the size of the group because 
performance deteriorates with size. Thus, in a group of 120,000 drivers only 
~19% are genuinely high-accident-potential whereas 13-16% are falsely 
identified. Even when as many as 120,000 drivers are identified by the 
richest model, only 22,363 of the 90,000 "high-accident-potential" drivers 
are caught in the net. 


Further Improvements. The performance of models for the estimation of a 
person’s accident potential can be further improved. We think that 
consideration should be given to a system which nearly continuously tracks a 
person’s accident potential. If during a certain period of time (measured in 
weeks) the driver did not acquire a conviction and was not involved in an 
accident, his or her estimated accident potential would be revised downward. 
If during that period of time, convictions or accidents were recorded, the 
estimated accident potential would be correspondingly revised upward. A 
person’s aging, the general accident trend and seasonal variation would also be 
reflected in these revisions. In this manner, a person’s current estimated 
accident potential could be made a reflection of his or her entire driving 
history. In such a scheme, there is no need to specify an arbitrary period of 
time after which points are forgiven. 


In the models developed so far, involvement in an accident adds a fixed 
amount to a driver’s accident potential. Under the newly suggested scheme, 
an accident for a person with an already high accident potential would be 
weighted more heavily. In general a "revision" scheme of this nature relies. 


on solid mathematical logic and is expected to perform better than other 
possible schemes. 
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APPENDIX 1 


Recalculation of the Estimated m Associated With Each Offense Type 
According to a Standardized Age-Gender Distribution 


Recalculation of the Estimated m Associated With Each Offense Type 
ccording to a Standardized Age-Gender Distribution 


The clustering of all possible offense types into a manageable number of 
groups was accomplished in several steps. The first step was to combine 
the offense types which are quite similar in nature into a smaller number of 
categories. A total of 45 such categories were established in our discussions 
with MTO in April 1987. These consisted of 38 categories of moving 
violations,. 3 for vehicle related offenses, and 4 for non-moving, 
administrative violations. 


This number of categories was still considered too large for practical 
purposes. In addition it was expected that some categories would still contain 
too few convictions. Thus an attempt was made to further group similar 
categories. This time, the similarity or difference of convictions was 
indicated by the contribution of each conviction in explaining a driver’s 
accident record. Some re-shuffling of convictions within the original 
categories was also done on this basis. 


The object of the exercise was to identify drivers with one and only one 
conviction of a given type (or category) in one calendar year, and for each 
conviction type, obtain the mean (m) and the [var(m)] of accidents in the 
other 3 years. These estimates were adjusted so that the age-sex distribution 
of drivers in each conviction class was the same as that for all drivers who 
had one conviction (of any type) in any of the 4 years 1981-1984. 


Step 1: Drivers with one and only one conviction (of any type) in any year 
were identified and grouped according to age and sex. This was designated 
the base population. The proportion of drivers in each age-sex group was 
then calculated. Table Al.1 shows, for example, that 8154 such drivers were 
males aged 30 at the time of the conviction, accounting for 0.0323 of the base 
population. The number 0.0323 is the weight that was applied in step 2. 


Step 2: Drivers with one and only one conviction (of a given type) in any 
year were identified and grouped according to age and sex. For each group, 
the proportion and number of drivers, the number and sum of squares of 
accidents in the other 3 years was calculated. For example, Table A1.2 
shows that of the 8154 30-year old male drivers (Table Al.1) having only one 
conviction in any year, 332 had running red light convictions. These 332 
drivers represent 0.02595 of the sub-population of drivers whose one 
conviction in a year was Type ml13, and had 99 accidents (Sum of squares = 
149) in the other 3 years. | 


Step 3: For a given conviction type the number of accidents and sum of 
squares for each age-sex group were weighted and aggregated as follows. 
For an age-sex group, the ratio of the population weight to the sub-population 
weight was applied to number of accidents and sum of squares for the group. 
Thus, for 30 year old males, the ratio is 0.323/0.2595 = 1.244 for "run red 
light" convictions, shown in column 4 of Table Al.2; the number of accidents 
(99) and the sum of squares (149) are "factored" by 1.244. The factored 
numbers for each age group are summed and the sums are divided by sub- 
- population size to get the (weighted) mean values for the sub-population. 
Table Al.3 shows that of all drivers in the "base" population, 13731 had "run 
red light" convictions and had 4270 accidents in 3 years. The method 


described above gives a weighted mean of 0.313 accidents per driver [E(m)] 
in this class. 


Step 4: 95% confidence limits for the estimates of E(m) were 
calculated. These are also shown in Table Al1.3. 


Step 5: Steps 2 to 4 were repeated for speeding convictions placed into 
categories according to the following guidelines: 


(a) Reflecting the current demerit system, drivers were grouped regardless 
of speed limit according to whether they had exceeded the speed limit by less 
than 16 km/h, 16-29 km/h, 30-49 km/h or more than 49 km/h. 


(b) For drivers grouped according to (a), sub-groups were identified for 
speed-limit groups of 20-40 km/h, 50-60 km/h and 80-100 km/h. 


Table Al.4 provides similar information to Table Al.3 but for the various 
types of speeding convictions. 
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Sum of 


Sum of 
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. BG6369 
. 8617946 
830525 
. 839716 
844541 

. 945748 
- 944541 

942516 
- 849586 
. 938615 
. 836747 
. 935813 
- 833894 
- 832274 
. 931249 
. 27758 
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829357 
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. 9148996 
. 813363 
. 611819 
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9.901168 46 
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Proportion Proportion 
For Relative Sum For Relative Sum 
Age No. class COs0. le ACCS an 5G sae NOs CLOSS to all ACCS aod. 
17 84 6.88657 §.961 21 21 15 $.98117 $.878 4 6 
18 247 $.81931 $.929 SIS at 2 1 66 $.88516 6.737 ihe) 25 
19 422 6.83299 8.925 159 244 121 6.89946 8.768 24 se 
20 497 $§.83885 1.822 183 293 138 $.91916 $.916 28 34 
21 571 6.94463 8.998 234 418 167 6.81385 %.839 27 35 
22 546 §.64268 1.872 195 295 262 §.81579 $.722 51 39 
Ped Sige Ooo Gl. to Jamel OOm mel la 1499 920.1165 091-854 1 Wd 
24 492 §.93846 1.185 163 329 161 6.81259 $.912 18 26 
25 436 $.93488 1.191 128 163 159 $.91243 $.915 3D 36 
26 428 6.83346 1.154 143 244 142 §.91118 9.978 16 18 
27 374 $.82923 1.257 185 196 138 $.81979 $.98¢ 2 25 
28 375 8.82931 1.194 88 178 124 §.8$969 1.855 22 26 
29 328 $.82564 1.291 182 162 129 8.681988 1.948 24 34 
39 D5 eee 02595. 901. 244 99 149 143 $.81118 6.916 25 31 
=| Soe eOehu/ ie eds We 7 San 2 BOOS) Samet he, 16 16 
a2 298 9.82329 1.192 Bow Ome ese ll io eae 21 25 
35 261 9.828468 1.188 66 98 131 §.81824 8.866 29 43 
34 259 9.82825 1.886 67 98 118 $@.92868 1.886 19 12 
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36 TSS) 0°914728 12.0135 big I111 9 185580200805 262818 14 14 
SA 185 8.81446 $.924 47 65 CSemeOnO0S21meOe 7.56 11 11 
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44 117 $.98915 %.975 ay 55 SOmECEOU45 Ss CRS IoG 8 diz 
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4&3 191 6.98789 €$.969 27 45 61 6.88477 6.657 12 14 
44 81 $.82633 1.868 169 922) 59008 09505" 90.775 8 10 
45 192 $.88797 @.766 21 25 56 $.88438 $.562 13 21 
46 89 9§.88696 §.785 18 26 36 6.92281 9.755 1 1 
47 79 $.99618 9.841 24 34 38 6.82297 §.626 4 6 
48 67 §.88524 §%.857 17 a 41 §.6@%8328 $.561 vf vi 
49 52 9.98496 1.958 9 as | 34 6.809266 8.662 5 5 
59 54 $.8G422 8.892 az 25 28 $.82219 $.579 6 19 
Si 56 @.98438 ¢.798 13 ed, 28 $.88219 8.575 5) & 
sy 63 6.88492 $.68¢ 11 Re 22 $.600172 $.679 5 5 
55 42 §.99328 9.919 7 7 Aiiaee OO 0.155 ee Oe 50 2 2 
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aS 44 8.99344 9.716 16 16 26 8.808156 $.598 4 6 
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68 11 $.98986 9.479 1 1 2 $.98916 1.987 7) 1) 
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76 5 §.88939 9.384 1 1 4 $.096931 9.176 1 1 
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78 7 @.92855 6.218 3 5 G @. 9.119 7) 7) 
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_ No. of 3 yr. Weighted 95% Limits 
Category Brief Description Drivers Accs. Mean Upper Lower 


ni Minor neglect, licenses,permits 6495 2918 8.434 .445 9.422 
n2 Neglect, insurance, permits, etc. 1589 719 8.414 $.438 8.392 
n3 License suspended,HTA. 874 454 B.424 8.456 8.394 
n& Learners 34 18 8.343 Craver Or 2 lc 
v1 Minor veh.;lamps,noise 2954 1498 8.468 8.485 8.451 
v2 Brakes, tires,unsafe vehicle 946 451 B.489 6.438 8.371 
v3 Comm.veh;size & weights 583 369 8.542 %.583 9.508 
mi Seat belt 12337 4858 g.376 9.384 9.368 
m2 Speeding 173592 55211 §.319 O35295 07517 
m3 Coreless driving 982 342 Ceoe7 etews [ibs eae he) 
m4 Slow driving 45 11 9.119 9.237 8.855 
m8 STOP sign, ROW violations — - 14824 3935 g.288 9.295 @.281 
m9 PXO violations 1237 SD %.296 Dee pe One sc 
mig Turns violations;right,left,U 18231 4942 %.283 %.289 8.277 
mi1 Unsafe move;open door 1649 542 8.334 Pome yehey (brat op 
mi3 Disobey red light 13731 4278 8.313 $.321 8.396 
mi4 Amber light 3453 982 @.285 §.299 $.271 
m15 Advance green — 274 TEs %.265 Prol7) oe 218 
.mi6 Fail to share road 178 62 %. 383 8.372 8.242 
mi7 .Passing violations 1395 459 Ses2/ G2 5510975085 
mi8 Wrong-way one woy street 1582 458 2.284 %.396 8.264 
mi19 Improper driving divided h'way 2599 999 8.361 0.379 8.344 
me OeebeaaGoee 934 S577 8.344 6.374 $.316 
m21 Emerg. veh., school x'ing 48 15 8.159 9.288 9.984 
m22 R/R crossing violations 95 35 6.314 9.408 6.233 
m24 Headlight beam not lowered 220 hy 6.268 %.318 8.269 
m25 Improper parking 145 FT) 8.487 9.484 9.334 
m26 Foil stop for school bus 624 US: §.281 9251759. 249 
m28 Disobey traffic signs 1656 529 G8.322 9.344 §.381 
m29 Fail report accident , 224 Ta §.266 9.324 9.215 
m3@ Fail remain at scene 236 75 SAAS 9.373 9.261 
m32 Dangerous driving C.C.C. a) 2 9.019 $.421 6.900 
m33 Fail remain at accident C.C.C. 66 39 Geo 77, 9.491 6.274 
m34 Dangerous driving C.C.C. 89 46 8.281 0.376 8.262 
m35 Impaired driving C.C.C. 2381 1848 8.443 9.462 6.424 
m36 Fail/refuse breath test C.C:C. 94 37 9.218 9.386 8.149 
m37 Foil or ref. breath test C.C.C. 120 58 6.314 2.397 6.241 
m38 Driving with >88 mgs. alcohol 3676 1582 8.386 $.481 6.371 
m41 Crowding driver seat 1208 eis 2.280 $.362 9.211 
m44- Radar device in vehicle 68 27 @.251 2.359 6.166 
m45 No safe helmet, motorcycle 96 47 8.266 9.357 9.191 
m46 Fail to signal to stop 18 6 2.8808 2.277 6.819 
m47 FTC, commercial vehicle 67 35 @.362 9.475 6.262 
m48 Fail to stop for police officer ce 6 2.986 .348 9.816 
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Speed Above No. of 3 yr. Weighted 95% Limits 


Gimigte id Mice Dicd Vv CIcCmmACC Soe Mean Upper Lower 
All G-15. 123224 Sees %.293 9.296 9.291 
All 16-29 79589 22479 %.286 $.289 9.283 
All 38-49 Pood TOS 6.297 Prov 2eOre 9 
All >49 894 S71 hs NS) G$.344 9.286 
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56-7 g-15 84248 22037 $.294 Delo vm Or cos 
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28-46 16-29 5276 1413 $.281 G.293 9.27 
59-78 16-29 5999 See eh ea, G.293 $.297 $.288 
88-126 16-29 34318 SG59 %.281 BJ 


26-48 36-49 683 185 $.269 %.392 8.239 
59-78 39-49 9394 2914 $.395 $73 |4eOeeI6 
82-189 38-49 13984 4232 $.292 $.399 B.285 


28-49 >49 14 8 $.186 G.432 9.864 
59-79 >49 495 183 G.341 §.386 9.299 
82-109 >49 475 186 $.272 $.312 9.236 
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Number of Convictions in Each of the Final 14 Conviction Categories 


Appendix 2 


1981-1984 CONVICTIONS BY PROPOSED CONVICTION CODE CATEGORIES 
(Based on sample of 8888 drivers with record) 


TOTAL CONVICTIONS IN 


Offence Code 1981 1982 1983 1984 OLD 
Pts. ID Tot. Acc 
Total for all convictions 3586 3842 3617 3276 14269 976 


REVISED CATEGORY nni1 


Dr. m/v,no valid permit 10033 g ales 46 GSmen aml 2 eee 
fail not. ch. address 18859 4) 9 5 Sean) 19 5 
Fail rep.dam to h’wy prop. 12766 1 1 1 1 ni 4 5 
2 a8 H a Gee 12770 1 g 4) fala 2 2 
Fail to have insurance card 29959 g “4 23 34 ni 57 3 
M eC OGtty " Mg 20868 4) "4 5 qe =n 14 
. ey! SV Use Ofeins. 20278 g g 2 G ni 2 
e Peni tenoteeoddre ch. 98810 g 7 6 oS) gh 16 5 
No driver’s license 19198 26 54 26 22m come c Ome 0 
Deiv=.:cond. proh. 19193 gD 5 4 Seana 14 2 
Learner unaccompanied 9878D 1 M g G6 n4& 1 
Class R Drive m/c at night 99719 g 1 1 G6 n& 2 
a PeCont_ to@liczcond., §96799 1 DJ g G n4& 1 
REVISED CATEGORY nn2 
Permit un lic drive. 18195 1s) 1 1 G6 n2 7 
Nomi C= ahOlletonlLDeseut 18214 g 1 1 Ceen2 2 
More than 1 license 13278 g " 1 a 92 aa 
No insurance 22618 g Y) i no faye 3 
Op. m/v, no insurance 26911 g g 3 Seen 8 3 
False statem. re. insurance 2693¢ g g 2 G6 n2 2 
Alter/deface no. plate 18199 g g 1 G6 n2 1 
Allow/use altered plates 18118 4) Z 4) eer riz. 3 
Plates unlawf. removed 10128 g g 1 G n2 1 
Pos. canc/susp. license 18222 g 1 G6 n2 1 
Lend driver license 1G224 g "4 1 G6 n2 1 
Fail surr. dr. license 10228 gD 1 4) Damen 2 1 
Unlaw. pos. license 19320 1 1 g G n2 2 
Dr. m/v, no valid plates 18834 g gD z Veena 9 
No valid. tag on plates 18635 Y) 4) 5 Sheen 39 
Draw trailer, no permit 18037 4) g Le 2ouent 35 
faiissurcea m/v permit 10838 J gD > 1 Oame nd 18 
" st thle 18939 g ") g Cert 2 
No plate 1B06B "4 1 g G ni 1 
No plate, operate m/v 18663 4) 29 5 oe tid 55 
No val. tag on m/v 10667 J Te 2 G ni 15 
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Imp. plate pos. m/v 

Foil apply for permit 
DEQWettl eenOsDLoLe 

Imp. plates on ven: 
Use unauthor. plate val. 
Expose oth. no. near plate 
No. on plate not visible 
Folie tOsprod ma yGG@ae 1 cense 

n " al n " 
No name on comm. veh. 
Draw trailer, no plate 
Fail to corry/prod.permit 
Foilsret, pla convee Vener 
Dealer plate infraction 
EGR SA One lc ence 
Dre iC eEsusp eHel.A- 
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REVISED CATEGORY vv1 


No sl-mov-veh sign 


Improper troiler attachment 


Imp./insuff./ drvg. lamps 
Imp. lights, motor cycle 
Imp. head/drvg. lamps 
Cov. or coat lamps 

Imp. ident. lamps 

Imp. red front lamp 

imp. nO- a plLate stamp 

Imp. park lights 

Imps ream Lampwon trl: 


impeecre Lamp obj > Lequmw. 


imps. oOnGglwdevaces 
No mudguards 
View obstr., sign/poster 

rs 4 miOGeLCLesahung 
Obscure color window 
Windows obstructed 

" " 
Rear window obstructed 
Def/imp/no muffler 
Proh. use studded tires 


REVISED CATEGORY vv2 


Def./no brakes, m/v 
Ae Pe Diet GOLlen 


Def. brakes, contre sLosrege 


Def./imp tires 


16916 
18966 
16426 
18423 
1B44B 
16445 
16488 
19582 
18536 
16546 
10576 
18586 
19616 
18730 
19826 
16816 
18815 
1883 
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Use proh./unsafe veh 
Refuse exam. unsafe veh. 
Drive unsafe veh. 


Rm./mod./inop. s. belt ass. 


REVISED CATEGORY vv3 


Overweight dual-axle 

Gross veh overweight 
Overweight during freezeup 
Over weight excess of perm 
Insecure load 


REVISED CATEGORY vv4& 


OpeaVeni ete. displ.~ dev. 


Imp. war. eq., dis. comm./v 


Not mark overhang load 
Excess veh. width 
Excess len. veh-comb. 
Excess veh. height 


REVISED CATEGORY ee1 


Excess fumes/smoke 
Unnecess. noise © 

Horn/bell violation 
By-law unnec. noise 


REVISED CATEGORY mm1 


Fail yield ROW 

Disobey police signal 
Disobey stop sign 

Fail to stop, intersec. 
Fail to yield ROW at stop 
Eda tes top. oc intersec. 
Foil yield ROW at stop 
Fail yield ROW at yield 
Fol 1s tO. yield «ROW, Priard: 
Fail yld. ROW at inters. 
Fail yield ROW 
Imp. RT at intersection 
Imp. RT multi-l1n. h’way 
Improper left turn 

Imp. LT at intersection 
Improper left turn 

Proh. U-turn 

Proh. U-turn, hill crest 
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12426 
11258 
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Prohibited turn 
By-law, proh. U-turn 
U-turn unsafe 
Prohibited turn 
By-law, proh. Turn 
- amber " 
Disobey amber light 
Disob. green arrow 
FOlly proc moter iosn. soreen 
Wrong-way,one-way street 
PXO= Viol esame = side: oford: 
wr w Oo ite h s id] " " 


Pass at Ped. crossing 


FOL ls LOD ahOnesch  eDlsS mo .c 


n " w A] i] meet 


REVISED CATEGORY mm2 


Disobey red light 
Disob. flashing red 
Disobéy redelight 

Diss portable@red sight 
PoQad as TODO tat) Tec hOSse 
Crossing r/woy barrier 


REVISED CATEGORY mm3 


Unnec. slow driving 
Foil to stop for emer/veh. 
"ST ODECV=SCh Cran Ss LODeSI ON 


Fail lower h’light beam app. 


Foil to rep. accident 
Crowding driver seat 

Radar dev in m/v 

No safe helmet, motor cy. 
Foils ctor sivgnols.to. stop 
EGi tes COp st OGeOO Lemon cCer 
Crim neg. caus. death CCC 
Dang. driving CCC 

Donger = driving CCC 
Careless driving 


REVISED CATEGORY mm4& 


Unsafe move 
- "from parked pos. 
Imp. open., veh door 
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Fail to share road 
Fail share when overtaken 
a Hs be overtaking 
EGU Te COsLULnEOULECO.nty.Lt 
Improper passing 
imp7 poss. —, appr» trartic 
Imp. pass. - overt.traffic 
Driving left of center 
Dingle Ot CCN LCV ae Cross. 
Pass on right/off r’way 
Gen feet cheer, lente, 
Disob. troprice sign 
Disobey legal sign 
Fail st. sign or mk. int. 
Disobey sign re- tunnels 
By-law, disob. sign 


REVISED CATEGORY mm5 


Fail remain at scene accid. 
Foil/ref. breath test CCC 


Foil or ref. breath sam.CCC 


Om leremrat..occi a. 1CCC 
Driving with >8% mgs.alcoh. 
Impoiredsdriving C2C.C. 


REVISED CATEGORY mm6 


Drive on closed h’way 
Imp. dr. multi-ln. h’way 


Foil obey paved shld. signs 


imp. driving, div. h’ way 
iM weusSeaCcenaweln..5—10ne Ge. 


Disobey off.signs,div.h’way 


Imp. driving, div.h’way 

Dr. m/v on sidewalk 

Dre in des-.bus dane 

Enter proh. h’way/r’ dway 
bee Co 

F.T.C., motor vehicle 

Imp. parking 

Imp. parking 

Imp. park., Interf. w.traf 
Improper parking 

F.T.C., commer. m/v 

Dr. comm. veh. 1t. lane 
Fail/imp. use of Seat Belt 
Fail ens. passen. s. belt 


11556 
11939 
12185 
12120 
12138 
12148 
12156 
88216 
88812 
8H822 
12178 
12189 
12369 
12380 
12438 
99519 
12196 
88518 
11997 
11899 
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REVISED CATEGORY mm7 


Speeding -- km in --km zn. 11555 6 2269 Dene PL WALES 1949 m2 8557 4 
Speeding 11485 g J 1 a 2 
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APPENDIX 3 


Model Weights for Alternate Demerit Point Schemes 


Appendix 3 


Regression Run A1: Age, sex as dummies, no accidents 


Description of variable Estimated Standard 
Coefficient Error 

Intercept (Male, Age < 21) ®.1839 ®.090284 
Dummy variable for age 21-25 -@.03752 @.002832 
Dummy variable for age 26-30 -@.0564 ®.0202894 
Dummy variable for age 31-35 -0.2689 2.023025 
Dummy variable for age 36-42 -0.05246 2.093499 
Dummy variable for age 41-59 -0.86087 @. 803491 
Dummy variable for age 51-60 -%.95989 @.804776 
Dummy variable for age > 60 -@.06138 @.%86789 
Dummy variable for female -0.86661 @.801721 
MMicwicy  LOsyIeld wimp. turns, PXO, 

amber violations, etc ®@.02736 @.091646 
Mc DISODey Tred sights; 

rail crossing violations. ®.845535 @.0092791 
mms: Fail report; accid.; Careless driving; 

Dang. driving;Crim. neg. caus. death ®.048755 @. 804567 
mm4: Unsafe move;imp. o’taking; 

Disobey signs ®.859365 ®@.893723 
MMS woail sLOeremainse breatmstest ; 

alcohol; impairment. ®@.068515 2.015328 
mno-ssecatebeltca ht. Tce parking: 

divided h’way offences. 2.03421 ®@.201961 
mnni: Minor neglect: no drivers license; 

permits; insurance, address change CeO 53975 2.809674 
Mnece ENE Selects, D/ Lic msuspendeGuor nct 

produced; plates, insurance. @.03312 @.@03617 
vvil: Minor vehicle neglect: lamps 

windows obstructed, etc. %.051595 ®@.206479 
vv2: Vehicle neglect: Unsafe veh., 

brakes, tires @.08548 2.809013 
vv3: Truck weight offences. fe RoW Gaels 2.25898 
vv4: Truck dimension offences. Oe et og ®@.011842 
eel: Environmental offences: . 

noise, fumes. @.091265 2.988340 
mm7: Speeding offences ®. 82982 @.0808408 
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Regression Run A2: Age, sex as dummies, tot. 


accs. 


me en a a a a a a a a a a a a a a aaa srw cw we 


Estimated 
Coefficient 


Error 


Intercept (Male, Age < 21) 
Dummy variable for age 21-25 


Dummy 
Dummy 
Dummy 


variable for age 26-36 
variable for age 31-35 
variable for age 36-490 


Dummy variable for age 41-590 
Dummy variable for age 51-60 


Dummy 


variable for age > 60 


Dummy variable for female 


mmi: 


mm2: 


mm7: 


Fail to yield, imp. turns, PXO, 
amber violations, etc 

Disobey redwirgnts; 

rail crossing viiolavions: 

Fail nepOmtmaecCida.mcene | esSeumivings: 
Dang. driving;Crim. neg. caus. death 
Unsafe move;imp. o’taking; 

Disobey signs 

Fall to remain; breath test; 
aiconol; impairment. 

Seat DELty tate Ce per king, 

divided h’way offences. 

Minor neglect: nosdriversmacense; 
permits; insurance, address change 
Neglect: D/Lic. suspended or not 
produced; plates, insurance. 

Minor vehicle neglect: lamps 

windows ODStructed,. Ec. 

Vehicle neglect: Unsafe veh., 
Brakes,eovuines 


Insecure load. 


: Weight and dimension offences. 


Environmental offences: 
noise, fumes. 


Speeding offences 


Totaleaccidentssinepemlodm 


. 02696 


-@42125 


-@23550 


- 86359 


-2444 


-2444 


- 02624 


- 93566 


- 9353566 


- 08673 


De 


-11074 


- 08748 


. 026515 


qgaoaoaaa gauoa ad 


. 0902876 
- 082852 
. 0029135 
- 004044 
- 8693516 
- 003551 

- 994832 
. 087868 
~O01755 
. 081753 
. 082916 
. 008613 
- 904959 
- 054575 
. 054575 
0129463 
- 2045708 
. 9045708 
- 699296 
SO271 200 
TOA oe 
- ©98354 


. 080866 


. 091588 
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Regression Run A3: Age, sex as dummies, fault accs. 


ea aananandas 


- 082923 
.®@82905 
. 982968 
.983101 
. 203581 
.003576 
-@84922 
.@87197 
.0@01764 
.081937 
.083221 
.929526 
- 085488 
. 061209 
. 982356 
.0©11584 
-®@84899 
.087244 
-2@18293 
. 29826 
.013051 


. 289204 


- 088957 


Description of variable Estimated 
Coefficient 

Intercept (Male, Age < 21) 0.1811 
Dummy variable for age 21-25 -~@.93879 
Dummy variable for age 26-39 -@.85755 
Dummy variable for age 31-35 -@.86119 
Dummy variable for age 36-40 -2.05389 
Dummy variable for age 41-58 -@.86141 
Dummy variable for age 51-60 -@.85906 
Dummy variable for age > 60 -@. 06341 
Dummy variable for female -@.86346 
Onion COR yl el ai mp se turns, fF £0) 

enverevyLolations);, etc Q. 02833 
DNZeeDESODEY Tedmlignts, 

Gate CrOossineeva Ol atl Ons 2.044185 
Wo ate neponueaccia.+) Careless driving; 

Dang. driving;Crim. neg. caus. death GE VeaZ25> 
mm4: Unsafe move;imp. o’taking; 

Disobey signs @. 865445 
Moe hai) tOeremain- breath test; 

alcohol; impairment. @.2462 
MNO ssseate belt; bo.C.s parking: 

divided h’way offences. 2.035359 
nniss Minor neglect: mo drivers license; 

; permits; insurance, address change @.82664 

mn2: Neglect: D/Lic. suspended or not 

produced; plates, insurance. 2.03417 
vvil: Minor vehicle neglect: lamps 

windows obstructed, etc. 2.05188 
VVvecu. Vehicliesnéglect: Unsafe veh.; 

brakes, tires @.28958 
VVoceansecure load O2166275 
vv4: Weight & dimension offences. @.114335 
eel: Environmental offences: 

noise, fumes. ®@. 889395 
mm7: Speeding offences @.®27835 
Total at fault accidents in period 1 2.856635 


-®82495 


A3.3 


Appendix 3 


Regression Run A4: Age/sex as dummies, fault, not at fault accs. 


me ee me en a a a a a a a a a aa a a a a awe wr wr wrens ee 


Estimated 
Coefficient 


Error 


Intercept (Male, Age < 21) 

Dummy variable for age 21-25 
Dummy variable for age 26-39 
Dummy variable for age 31-35 
Dummy variable for age 36-48 


Dummy 


variable for age 41-58 


Dummy variable for age 51-60 


Dummy 
Dummy 
mmi: 
mm2: 
mm3: 
mm4: 


mm5: 


mm6: 


mm7 : 


variable for age > 68 

variable for female 

Ne wish Gelsy Sid Nwels SYehes Guibbesetsic, $794 0)r 
amber violations, etc 

Disobey red lights; 

rail crossing violations. 

Faily report accid:.; Careless driving; 
Dang-) drivings Crimea neg.) causmeceaun 
Unsafe move;imp. o’taking; 

Disobey signs 

Fail» to remain; breath vest; 
alcohol; impairment. 

SCavDel ius ae Cee penK tie. 
divided h’way offences. 
Minorerneglect nosorivers sii cense, 
permits; insurance, address change 
Neglect: D/Lic. suspended or not 
prodquced; places, einsurace. 
Minor vehicle neglect: lamps 
windows obstructed, etc. 
Veniclesneglect-mUnSauemV Gling 
brakes, tires 


Insecure load 


: Weight & dimension offences. 


Environmental offences: 
noise, fumes. 


Speeding offences 


Totalrvat faults accidentssinsper oda 


Total driving properly accidents, period 1 


. 082876 
-®@82852 
- 902915 
- 083044 
. 003516 
-@8551 

- 204832 
- 207068 
- 001735 


geo ao aaa a8 


2.201980 
@.0093158 
®@.2989338 
@.@853572 
2.068012 
0.002310 
0.011554 
@.084018 
2.207183 
2.010091 
®.929246 
2.012793 
2.289020 
2.880939 
@.902446 


2.082484 
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Regression Run Bi: No Age/sex variables, 


no accs. 


= aa a a a a a a a a a a a a a a a a a a a ae ae a ee 


Estimated 
Coefficient 


a a a a a a a a a a a a a a a ee ee ee ee ee ee 


Intercept 

MMe eel evo yl CLO et MP em curs y EEA OG 
amber violations, etc 

mme. Disobey redviients; 
Gaiiecrossingeviolations. 

Mmseemaitereporteaccid,., Careless driving; 
Dang-pdriving;Crim. neg. caus. death 

mm4: Unsafe move;imp. o’taking; 
Disobey signs 

mm5: Fail to remain; breath test; 
alconouss impairment. 

moO wocate Del tee rT Cs) parking: 
divided h’way offences. 

nis) Manor neglect: novdrivers license; 
permits; insurance, address change 

Dna Ne elect )/ Lic. suspended or not 
produced; plates, insurance. 

vvi: Minor vehicle neglect: lamps 
windows obstructed, etc. 

Vile gmeVvensec leaner Lect maunsate, ven. , 
bmakesh atares 

vv3: Insecure load 

vv4: Weight & dimension offences. 

eel: Environmental offences: 
noise, fumes. 

mn7: Speeding offences 


Q.85224 


@.85341 


2.06617 


@.86722 


2.879245 


@.0435205 


2.943055 


2.84907 


2.06633 


0.18202 


2.2004 


2.12488 


0.1207 


@.03416 


2.881486 


2.201790 
@.283035 


@.004958 


2.984054 


®@.816679 


2.982128 


2.810544 


@. 905926 


0.807845 


@.909826 


@.827991 


Q.912789 


2.009048 


2.980907 
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Regression Run B2: No Age/sex variables, total accs. 


Description of variable Estimated Standard 
Coefficient Error 
Intercept ES Me yeys) - 8.881520 


Dns bar letOsylel ca, eMpemcuUnnc mer AO. 


amber violations, etc ®@.835148 %.821886 
mm2: Disobey red lights; 

rail crossingyviroletions. @.84925 2.003138 
mn3>= Fail report) accid =); Careless sdrivineg, 

Dang. driving;Crim. neg. caus. death @.841585 @.809278 
mm4: Unsafe move;imp. o’taking; 

Disobey signs @.872585 @.985341 
mm5<) Kal letoeremaine Dreatlo Tesi: 

alcohol; impairment. G2 o> %.95983@ 
mmO<) SeatebDelt jh) Cee parking: 

divided h’way offences. @.8@3992 2.802298 
mniceMinor neglect. snocdriviers aicense, 

permits; insurance, address change @.8355915 2.811297 
nn2: Neglect: D/lic] suspended orenet 

produced; plates, insurance. @.84807 @.203981 
vvile Minor vehicle neglect; lamps 

windows obstructed, etc. @.062545 @.82796@ 
vv2e: Vehicle neglect: Unsafe veh-, 

brakes, tires 0.101415 2.810035 
VV5o2) Insecure load @.169635 @.829873 
vv4: Weight & dimension offences. Oot i1 S52 @.012712 
eel: Environmental offences: 

noise, fumes. @.114985 2.808946 
mm7: Speeding offences ®@.03063 @.008927 
Total accidentsein periods ®. 965045 @.921696 
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Regression Run B3: No Age/sex variables, fault accs. 


@.081520 


@.001893 


@.003515@8 


@.009317 


2.205363 


@.859893 


2.882298 


2.011343 


2.885997 


%.807088 


2.018074 


@.829239 


@.012786 


@.828981 


®@.@80929 


Description of variable Estimated 
Coefficient 

Intercept @.18695 
Mice ae VOR y elo, imp. | curns, PxO; 

amber violations, etc @.05314 
Miz<Disobey red Lignts; 

rail crossing violations. @.251685 
Moe ratlerenor veacciG., Careless? driving; 

Dang. driving;Crim. neg. caus. death @.04427 
mm4: Unsafe move;imp. o’taking; 

Disobey signs @.874925 
MMos alhetOo remain; sbreath test; 

alcohol; impairment. @.2662 
muon seat. belt; sk. T.Csssparking; 

divided h’way offences. 2.04174 
mnie Minor neglect: no drivers license; 

permits; insurance, address change O27 0567.55 
Hue Neglect: .)/ Lic. suspenced or not 

produced; plates,. insurance. 0.04918 
vvi: Minor vehicle neglect: lamps 

windows obstructed, etc. 0.06561 
VV ComVenUCLeaneg lect: Unsaresveh., 

brakes’, tires 2.18512 
VVo- einsecure load 2.178885 
vv4: Weight & dimension offences. he eats as) 
eel: Environmental offences: 

noise, fumes. @.118 
mn7: Speeding offences @.03226 
Total at fault accidents in Pen ode %@.265087 


G.0024335 
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Regression Run B4: No Age/sex var., fau.,n/f. accs. 


me es me as es ew we we we ee a a ea ae a a a a a a aa ewes nse srw csr ews ee 


Description of variable Estimated 
Coefficient 
Intercept @.18555 


mms Faille to yield, =1mpemeturns,e AU, 


amber violations, etc @.8@31495 
mm2: Disobey red lights; 

rallisecrossingeviolaccouse 2.849245 
mms: Babl reporteaccids; Canelessscriving, 

Dang. driving;Crim. neg. caus. death 0.8417 
mm4: Unsafe move;imp. o’taking; 

Disobey signs @.97265 
mm5: Fail to remain; breath test; 

alcohol; impairment. QRZ DDS 
mmos Seat. Delt; "hal Gy, sparking. 

divided h’way offences. @.@59955 
nni: Minor neglect: no drivers license; 

permits; insurance, address change Q@.8@36@82 
nn2: Neglect: D/Dice suspended or nov 

produced; plates, insurance. 2.048135 
vvi: Minor vehicle neglect: lamps 

windows obstructed, etc. @.86255 
vv2: Vehicle neglect: Unsafe veh., 

Drakes. canes 2.101415 
vv3: Insecure load @.16913 
vv4: Weight & dimension offences. @.11844 
eel: Environmental offences: 

noise, fumes. an ed todo) 
mm7: Speeding offences @.030625 
Totalmwat faulteaccidents Aneperiodun @.26248 
Total driving properly accidents, period 1 @.86775 


A3.8 


Error 


0.001494 


0.001855 
0.903085 
9.009119 
@.805251 

0.058638 
0.902251 

0.011103 
0.003913 
0.006942 
0.009862 
0.028628 
0.012517 
0.008794 
0.000911 

3.002384 


@.002427 
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Regression Run C1: Age,sex; total cons., no accs. 


Description of variable Estimated Standard 
Coefficient Error 

Intercept (Male, Age < 21) @.1839 0.80284 

Dummy variable for age 21-25 -@.@3752 @.802832 
Dummy variable for age 26-39 -0.8564 ®@.8@92894 
Dummy variable for age 31-35 _-%. 0689 @.883025 
Dummy variable for age 36-490 -2.05246 @.8@835499 
Dummy variable for age 41-52 -2.%6007 @.8@083491 
Dummy variable for age 51-60 -8.25989 @.004776 
Dummy variable for age > 6@ -2.86138 @.806789 
Dummy variable for female -0.@6661 @.001721 
Total convictions, period: 1 2.03212 %@.009812 
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Regression Run C2: Age,sex; total cons., total accs. 


ee ee a a ea a a a er a a a ee a a a aaa 


Description of variable Estimated Standard 
Coefficient Error 

Intercept (Male, Age < 21) Or1165 @.902876 
Dummy variable for age 21-25 -@.03856 @.8902852 
Dummy variable for age 26-30 -@.0565 2.002913 
Dummy variable for age 31-55 ~2.86006 Q@.083044 
Dummy variable for age 36-40 -0.95202 @.003516 
Dummy variable for age 41-59 -0.06013 2.00351 

Dummy variable for age 51-68 -~0.@5744 2.204832 
Dummy variable for age > 60 -~%@.06156 2.807068 
Dummy variable for female -@.86122 2.801735 
Potad sGconvictions, period) 0.82951 2.880827 
TotalsAcciadents, periods! @.95962 2.801674 
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Regression Run C3: Age,sex; total cons., fault accs. 


Description of variable Estimated Standard 
Coefficient Error 
Intercept (Male, Age < 21) %.1811 ®@.002923 
Dummy variable for age 21-25 -%.%93879 ®@.882985 
Dummy variable for age 26-30 =0.85755 @.002968 
Dummy variable for age 31-35 -@.06119 2.903181 
Dummy variable for age 36-40 -%.05309 2.903581 
Dummy variable for age 41-590 -@.06141 ®@.@083576 
Dummy variable for age 51-60 -@.05986 2.984922 
Dummy variable for age > 60 -%.96341 2.087197 
Dummy variable for female -%9.06346 @.001764 
Totawsconvictions, period a ®@.93182 2.820842 
Atebauit Accidents, peniod | ®@.05743 @.002434 


Appendix 3 


Regression Run C4: Age,sex; tot. cons., fio,n/fewaccs. 


Description of variable Estimated Standard 
Coefficient Error 

Intercept (Male, Age < 21) Q@.1763 @.802876 
Dummy variable for age 21-25 -@.83856 2.882852 
Dummy variable for age 26-30 -%@.0565 2.002913 
Dummy variable for age 431-35 ~@. 96986 2.983044 
Dummy variable for age 36-4@ -@.05202 @.003516 
Dummy variable for age 41-50 -%.86015 %.00351 

Dummy variable for age 51-60 -0.95744 @.894832 
Dummy variable for age > 60 -@.86156 @.007968 
Dummy variable for female -@.86122 2.881735 
Total convictions, period” 0.82951 2.800827 
AWeraunDeACCIGeNtS.mpenoden 2.05526 2.802388 
Driving properly accidents, period | @.@6289 @.002426 
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Regression Run Di: No age,sex; tot. cons., no accs. 


Description of variable Estimated Standard 
Coefficient Error 

Intercept 0.1873 2.001424 

Total convictions’, period 1 2.93891 @.880773 


A3.13 


Appendix 3 


Regression Run D2: No age,sex; tot. cons., tot. accs. 


a a we a a aa a a a a a aw awe asec ase ee 


Description of variable Estimated Standard 
Coefficient Error 

a | eer | Goto 

TOtamecOnVvicul ONS pe glo” m C2055 18 2.880785 

PObaAleACCIGents «apeT OOM! @.06618 @.201604 
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Regression Run D3: No age,sex; tot. cons., fau. accs. 


Description of variable Estimated Standard 
Coefficient Error 

oe ee ee Gy gss0 es) eoScrasaal | 

Total convictions, period 1 @.03705 %.2828798 

At fault Accidents, period 1 ®.06644 0.002338 
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Regression Run D4: No age,sex; tot.cons., f-n/f.eaccs= 


Sg gyn cag ee me Teac aR meee ae a a ec a a a a ee a ce en a ee 


Description of variable Estimated Standard 
Coefficient Error 

ee nna nee ys 

Total convictions, period 1 2.03518 2.808785 

At fault Accidents, period 1 %.86378 @.802291 

Driving properly accidents, period 1 0.86862 0.802334 
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Theoretical Framework 


Theorectical Framework 
Each driver has an expected number of accidents for 1983-1984. For driver i 
this will be denoted by m.. Consider a reference population of drivers 
identical in gender, age-group, and also in their 1981-1982 count of convictions 
and accidents. The average of their m’s is E{m}. Since not all the m’s in 
the reference population are the same, there is also a variance of m’s, 
VAR{m}. 
We estimate E{m} by 
E{m} =a, + a, + a, + b,x, + box, + ... (1) 
where 


a, is a parameter for the "Base Driver" (who is a male in the 17-21 age 


group). 
ais a parameter, g = 2,3,...8, depending on the age group as shown below 
2 21-25 years old in 1982 
3 26-30 t aks ! "" 
4 31-35 tt I] 1 9 
5 36-40 't t t I] 
6 41-50 "! ‘Al A] ¢ 
7 51-60 tt " A "9 
8 > 60 't t t iA] 
a. is a parameter for "female" and by, b> =. are the “weights or 
"Darameters" for variables x,, x, etc., which are the numbers of convictions 
or accidents of a given type during 1981-1982. 
We estimate the variance of the m’s by 

VAR{m} = [E{m}]*/k | (2) 

where k is a parameter. 
A4.2 Estimation of Parameters 
‘We have estimated the parameters of equation (1) by a software well suited 
for accident counts. It facilitates the appropriate representation of accident 
counts by the negative binomial distribution and yields maximum _ likelihood 
estimates for the parameters. (Baker, R.J. and Nelder, J.A., The GLIM 


system. Generalized Linear Interactive Modelling, 1978.) 


The parameter k in equation (2) has also been estimated. by maximum 
likelihood methods. A special program had to be written for this purpose. 
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A4.3_ Estimation of E{m} and VAR{m} from Accident Counts 


It can be shown that if for every driver the accident occurrence obeys the 
Poisson Probability Law, then, for a group of drivers, 


E{m} = x = sample mean of accident counts 


VAR{m} = Saweex where s~ is the sample variance of accident counts. 


A4.4 The "Gamma Assumption" 


Much of the analysis in section 4 rests on the assumption that the distribution 
of m’s (accident potential) in the population of the Ontario drivers can be 
represented by a Gamma probability distribution function. As noted in the 
main text in section 4.4 we cannot be sure this assumption is a valid one until 
we check whether the accident counts which are observed support. the 
"Gamma Assumption". | 


In tables A4.1 to A4.9 we check whether the number of drivers observed to 
have 0,1,2,... accidents can be well reproduced if one uses only x, s', and the 
Poisson and Gamma assumptions. Thus, for example, there were 6923 
males, 17-20 years old, who during 1981-1984 had 1 accident (see Table 
A4.1). During the same period of time for males who were 17-20 years old, 
the sample mean was 0.463 accidents and the sample standard deviation was 
0.718 accidents. Using these two Sample statistics, we should expect 6867 
drivers to have 1 accident if the "Gamma Assumption" was correct (Table 
A4.1). What has been observed and what is expected if the Gamma 
Assumption is correct seems to correspond. In fact, all pairs of numbers in 
all tables display good correspondence. 


In tables A4.1 - A4.9 the Gamma assumption was tested "weakly". We used 
two numbers (x,s”) to generate 5 to 9 other numbers and showed that these 
matched well what has been observed. We now proceed with a stronger test. 


In tables A4.10 to A4.23 we examine 14 two period tables. Thus, for 
example, in Table A4.10 we show that 3214 male drivers, 17-20 years of age, 
had 0 agcidents in 1981-1982 and 1 accident during 1983-1984. Now, using 
only x,s° for 1981-1982 and x for 1983-1984, we use the Gamma assumption 
to estimate what would be the expected number of such drivers - S252 63 
Once again, the correspondence in all 15 tables is impressive. 


We conclude that. the information we have about accident counts for Ontario 


drivers does not contradict the appropriateness of making use of the Gamma 
Assumption. — 
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ABLE A&.1: ACCIDENT DISTRIBUTION FOR 17-28 YEAR OLDS 


No. of accs Drivers with x accs. Drivers with x accs. 
1981-84 (x) Observed Estimated Observed Estimated 

i) 16597 ODO, 16966 16977 

1 6923 6867 3892 3969 

2 heey 1782 S39 369 

3 343 370 45 37 

4 TAS) 67 5 3 

5 135 uy 

6 g 2 

All 25636 20447 


No. of accs Draversewithex acces. Drivers with x occs. 
1981-84 (x) Observed Estimated Observed Estimated 

4) 45854 45224 59331 58359 

1 nad o2 14778 7 sy2e) Thsisw/ 

2 3283 3441 855 8893 

5 662 693 84 95 

4 154 129 As 16 

5 28 25 iy 1 

6 3) 4 

fh 4 1 

All 64292 58911 
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NOm@Oneaces Drivers with x accs. Drivers with x accs. 
1981-84 (x) Observed Estimated Observed Estimated 

g 49857 50816 57888 57962 

1 12961 12607 6968 6946 

2 2528 2728 734 726 

5 493 S44 52 72 

4 128 196 Z, Ye 

5 2 29 4 1 

6 2 4 4) g 

7 1 1 1 i) 

8 2 g 

9 1 g 

All 65398 65654 


NOMEOMmOCCS Drivers with x accs. Drivers with x accs. 
1981-84 (x) Observed Estimated Observed Estimated 

g 39864 39993 58866 52891 

1 19985 9738 : Caves 6198 

2 1861 1952 589 648 

S) 291 S565 78 66 

4s 74 65 9 v/ 

2 16 11 4) 1 

6 tf, 2 1 g 

7, 4 4) 

8 2 7) 

All 52124 57726 
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No. of accs Drivers with x accs. Drivers with x accs. 
1981-84 (x) Observed Estimated Observed Estimated 

g 20446 | 28582 29255 29254 

1 4884 4744 3682 3689 

2 927 1069 416 4G2 

> 192 284 32 43 

4 43 41 7 5 

5 9 8 1 1 

6 1 2 

7. 5 g 

All 26581 S555) 1155 


NOs oOneaccs Drivers with x oaccs. Drivers with x accs. 
1981-84 (x) Observed Estimated Observed Estimated 

4) 24587 24665 31884 31894 

1 5429 5256 346 3435 

2 972 1941 336 348 

& 186 291 aes 34 

4 28 38 8 2 

= 19 vs 1 g 

6 5 1 

vs 5 4) 

8 1 g 

AL 31216 34916 
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a a ee ee = ww es = ee ee ee 


No. of accs Drivers with x accs. Drivers with x accs. 
1981-84 (x) Observed Estimated Observed Estimated 

4) 12119 12113 14873 14864 

1 2361 2575 1387 1419 

2 413 496 176 Wz 

3 68 66 16 17 

4 19 19 g fe 

5 1 2 

All 14972 16446 


No. of accs Drivers with x accs. Drivers with x accs. 
1981-84 (x) Observed Estimated Observed Estimated 

QD . 6456 6469 WAZ Wi 

1 1199 Yuys 788 792 

2 197 Zale 66 63 

3 41 39 4 4 

4 5 i 

5 4 1 

All 7962 797@ 
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NO-sOtmoccs Drivers with x accs. Drivers with x accs. 
1981-84 (x) Observed Estimated Observed Estimated 

g 214884 215424 2585/5 258458 

1 58935 57736 33997 32924 

2 11874 12441 3585 vove 

3 2276 2463 338 370 

4 5689 466 47 Oi, 

5 184 86 a 4 

6 28 16 1 gD 

Ps 19 3 1 g 

8 5 1 

9 1 ) 

All 288635 Pac he yet F/T) 
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TABLE A&.18: 2-PERIOD ACCIDENT FREQUENCIES FOR MALES AGED 17-28 YEARS 


(Period 1: Mean=%.2461, Var(x)=%.2605; Period 2: Mean=%.2173) 

Per 

1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs g 1 2 3 4 5 

9 1699751651958 e521 Ses os5on5 422 391.6 SY Shs Ff = Sime G §.2 
1 3789 3669.5 848 886.9 T22meel eo at ue) 14.3 3 1.4 G6 §.1 
2 So 592.2 Uae Up Faye. 9 il 24.3 6 Sul Y) G.3 G 9.9 
5 99 54.8 19 18.4 2 eh 8) D 0.5 ) (ae, 4 G 9.B 
4 5 Smee 1 2.0 g .4 g 6.1 g 5 G6 OB. 
5 1 G.5 a) G.2 g G.1 g 0.9 ) G.B G 2. 


TABLE A4&.11: 2-PERIOD ACCIDENT FREQUENCIES FOR FEMALES AGED 17-28 YEARS 


(Period 1: Mean=$.1%57, Var(x)=8.1885; Period 2: Mean=@.%886 ) 

Per 

1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs ) 1 2 3 4 = 

§ 16966 16951.9 1382 1394.9 70 FA9 3 amo Y) %.1 G 9. 
1 1718 1788.9 ee ASS 4 es 18.9 1 g.5 7) G.D G G.B 
2 96 187.9 20 13.4 3 1.0 B g.1 1) B.D G 9. 
3 i; oF 1 Wists ) G.1 Dg B.D ") ad SU (A 
4 D Sr2 g .D g B.D " Z.D g B.D G G.D 
2 4) G.D M4 B.D DB B.D g B.D ) G.B G G.D 
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—_—e——— oro a a a ww wn a wn ae we we we we we we we i we we ae ee ae ee ee ee ee ee ee ee eS 


TABLE A&.12: 2-PERIOD ACCIDENT FREQUENCIES FOR MALES AGED 21-25 YEARS 


(Period 1: Mean=%.2849, Var(x)=6.2231; Period 2: Mean=%.1746) 
Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs. Ses 1 2 3 4 5 
G 45954 45249.8 6917 6781.8 TRY SU pre) 5 74 68.1 14 Seg Uo Ness) 
1 aVelete:  Uftelstoyh abet ieS. Pra 246me2597/ 39 276 2 265 Wek 
2 968 1964.1 22 tee Simo 49 48.6 6 6.6 3 9.8 6 6.1 
3 128 118.6 S7 38.9 8 726 "4 Uo g G.2 G 8.o 
4 15 ete 8 4.6 1 eed 1 G.2 4) Z.9 G6 9.9 
5 3 1.1 4) Dab 1 G.1 g G.d 9g 9.8 G 9.9 


TABLE A4&.13: 2-PERIOD ACCIDENT FREQUENCIES FOR FEMALES AGED 21-25 YEARS 


(Period 1: Mean=%.%8861, Var(x)=6.9991; Period 2: Mean=%.%779) 
Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs ") 1 2 3 4 5 
Bee 551859557285 °5592) 5606.2 219 198.9 Se vec 1 O25 1° 829 
1 4931 3988.1 416 437.8 24 32.4 1 2.0 4 G.1 G G9. 
2 226 242.1 39 Soc 6 55> 1 %.2 "7 0.g G 9.9 
3 16 dome 2) 7h ) .3 g G.B B Z.D G 9. 
a 'g Die 7. 1 G.2 g G.D D B.D 4) Z.D G G. 
5 4) G.G 7) B.D g G.D g B.D g B.D G 8. 
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TABLE A&.14&: 2-PERIOD ACCIDENT FREQUENCIES FOR MALES AGED 26-38 YEARS 


(Period 1: Mean=%.1698, Var(x)=%.1888; Period 2: Mean=@.1452) 

Per 

1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs: 4) 1 2 %) 4 5 

G 49857 59994.8 6121 5926.5 G159 767 1720 aye 59.9 11 si 7/ G§ 9.5 
1 GSE050 6562-5 sie oom ooo 165 199.9 28 Pass 2) 4 3.9 ime oo 
2 708 749.3 263 2208.4 39 42.1 18 6.6 3 g.9 ip Wiha 
5 Ue) Siz foi Sha 3 16%) 1 1.4 "4 G.2 G6 OB. 
4 16 8.6 4 4.9 7 bet 7) G.2 M G.g G6 G.G 
5 2 G.9 9 g.5 1 G.2 1 G.D 1 G.D G 9.9 


TABLE A&.15: 2-PERIOD ACCIDENT FREQUENCIES FOR FEMALES AGED 26-398 YEARS 


(Period 1: Mean=6.8658, Var(x)=%.9692; Period 2: Mean=.#659) 
Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs ") 1 2 fe) 4 5 
Oe I5 7883. 579220.2 0 OF O5 moe OUn2 215 2185-9 8 ono 1 G.5 G G. 
1 3595 3449.8 348 368.8 aS 28.4 2 1.9 1 G.1 G6 9.8 
2 181 ioone Fae > Piel) ") 29 2 8.2 g G.G G$ 8.8 
3) 8 9.4 3 EIS 1 G.2 Y) B.D M4 G.g 6 8.9 
4 1 G.5 ) .1 "4 G.B 1 B.D g 0.8 G$ G.B 
5 g %.G ) B.D g %.D M7) G.g g G.D G$ 9.6 
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TABLE A&.16: 2-PERIOD ACCIDENT FREQUENCIES FOR MALES AGED 31-35 YEARS 


(Period 1: Mean=8.1495, Var(x)=9.1635; Period 2: Mean=%.1394) 

Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs g 1 a 3 4 is 
G@ 39864 39963.9 4893 4719.6 443 452.8 35 46.1 2 ea! eS 
1 5282 5962.6 934 971.4 188 129.6 21 14.6 3 125 G6 9.1 
2 484 521. 114 138.4 24 Zoo 4 ie. 3 G.4 G 9. 
3 42 49.5 19 16.8 5 eA) 3 8.6 4) 9.1 8.0 
4 8 4.5 2 19 1 G.5 1 g.1 1 G.B G6 9. 
5 1 G.4 4) %.2 2 G.1 4) 8. gD 0.2 6 9.9 

TABLE A&.17: 2-PERIOD ACCIDENT FREQUENCIES FOR FEMALES AGED 31-35 YEARS 

(Period 1: Mean=$.8635, Var(x)=%.9673: Period 2: Mean=§.%687) 

Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Aces: g 1 2 3 4 S| 
Wee 0500) 5095045595216 511861 166m tose fF 19.6 MY) 0.6 G 9.B 
it 2961 2886.¢ 2045 2 559R5 25 rae) de 5 cu g %.2 G $G$. 
2 142 LOZ 31 2e35 = Se g G.3 4) G.D G G.9 
3 va 8.4 2 Ue g 8.3 1 B.D D 0.9 G GB. 
4 1 B.4 M) G.1 g B.D g 0. "4 B.D G GB. 
Ss g 2.D Dg 9.8 M4 G.B 4) G.D " 0. G 9.8 


A4.10 


Appendix 4 


TABLE A&.18: 2-PERIOD ACCIDENT FREQUENCIES FOR MALES AGED 36-46 YEARS 


(Period 1: Mean=$.1476, Var(x)=8.1641; Period 2: Mean=§.1381) 

Per 

1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs g 1 2 3 4 5 

GP 26448 29468.3 2339 2323.8 22) gee 17 22.9 2 2. G §.2 
1 2545 2483.8 487 495.4 64 7 OR, 9 S/. 4) 1.9 ee eg 
2 PANS 264.7 77 VAa.)s) 28 14.6 1 2 4) %.3 G G.G9 
= 34 Zong 12 18.8 6 2m5 1 G.4 3 0.1 G G.B 
4 g Les, 2 Use B DES 1 B.1 g 2.¢ G G8. 
5 4) 5 5 4 | 1 ) B G.D g G.D G6 9. 


TABLE A&.19: 2-PERIOD ACCIDENT FREQUENCIES FOR FEMALES AGED 36-48 YEARS 


(Period 1: Mean=$.9699, Var(x)=8.9732. Period 2: Mean=%.%671) 
Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Accss g 1 2 3 4 = 
Gee292555 2922455 757 7 9oeo 10 325 2 4.5 1 Z.2 G GO. 
1 1845 1868.4 218 192.8 u 14.6 3 G.9 g G.1 G- G6. 
2 938 188.5 18 14.6 2 1.4 1 6.1 g 2.D G6 9.6 
5 5 Sal 1 %.9 GB hed ) 0.D gD 0 .D G 9. 
4 g G.2 g G.1 4) B.D 4) B.D 4) 2.2 G G.6 
5 g D.D i) 0.9 g B.D 4) B.D g %.D $ GG. 
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TABLE A&.28: 2-PERIOD ACCIDENT FREQUENCIES FOR MALES AGED 41-58 YEARS 


(Period 1: Mean=%$.1324, Var(x)=6.1565; Period 2: Mean=%.1285) 
Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs 4) 1 fd %! 4 5 
9 24587 24773.9 2688 2587.1 pets ketosis | 26 20m 4 ete’ G G$.3 
1 2/49) 2582.7 7, Ome Ole. S 66 S225 7 11.4 3 Eka) a 
2 262 Fahey ata) 70 85.8 u/ 1726 1 eee D G25 he A hese 
3 24 Bore 9 21 4 Sel Gg G.7 1 6.1 G G.G 
4 1 Sul 1 126 6B $.5 Sere Oe g B.D G G.G 
5 1 %.3 2 %.2 o 6.1 DB G.D Gg G.D G G8. 


TABLE A&.21: 2-PERIOD ACCIDENT FREQUENCIES FOR FEMALES AGED 41-58 YEARS 


(Period 1:. Mean=6.@624, Var(x)=8.8664; Period 2: Mean=%.%593) 
Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs. M4) 1 2 %) 4 5 
$5 1084.51156.5- 1677 1639.9 97 88.3 1) 4.8 1 %.3 G 8. 
1 tae p= are pe! ierer  ahshe AS) 10 Ou 2 TT ") G.1 G G.G 
2 84 SiS 18 152.9 2 UW 1) 0.2 ") B.D G G.G 
3 7 DE6 2 lee ) J.2 M7) B.D DB .D G 9. 
4 1 6.3 ) 9.1 g 2.2 Y) B.D 7) G.B 9. 
5 1 B.2 iy .D g .D ) G.D 1) G.D G 9. 
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TABLE A4&.22: 2-PERIOD ACCIDENT FREQUENCIES FOR MALES AGED 51-68 YEARS 


(Period 1: Mean=$.1196, Var(x)=9.1264; Period 2: Mean=9.1%99) 
Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs. B 1 2 3 4 3) 
9 12119 1294169 911988119385 181 S/z5 6 Seo "4 %.3 G 9. 
1 A ON 2 fee, 206 189.9 26 18.4 vy Gis: My) 0.1 $ G8. 
2 196 193.3 32 20.8 2 2.4 ) G.2 ") D.B GS 9.8 
3 4 Jee 1 TER7. 1 g.3 g B.D Y) 0.8 G 9.6 
4 M4 G.5 7) 9.1 g g.B g B.D 7) G.6 G G. 
5 "y 0.9 GB B.D g B.D g %.B 4) D.D G 8.9 


TABLE A4&.23: 2-PERIOD ACCIDENT FREQUENCIES FOR FEMALES AGED 51-68 YEARS 


(Period 1: Mean=8.%557, Var(x)=9..9538: Period 2: Mean=%.%522) 

Per 

1 Observed and estimated no. of drivers with periods 2saccidentscounts== 
Accs. 4) 1 2 3 4 2) 

$ 14873 14867. 672 686.2 42 36.6 1 220 DB G.1 G G. 
1 Tile 725m 76 Tas) Hf 12 6.4 Bg 0.5 B.D G 9. 
2 32 48.9 4 6.8 M4 %.8 g G.1 J B.D G 8.9 
3 1 2.4 B G.5 "4 O.1 g g.D B B.D G 9. 
4 Dg | DB B.D 4) B.D Dg G.D ) G.G G G.9 
5 4) %.D Gg G.G ") B.D J 0.D "4 0.2 G 8. 
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TABLE A&.24: 2-PERIOD ACCIDENT FREQUENCIES FOR MALES AGED > 68 YEARS 


(Period 1: Mean=%.1212, Var(x)=8.1392; Period 2: Mean=%.1896) 

Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs g 1 2 3 4 s) 
") 6456 6499.4 SOS me ioe 35 45.1 2 4.1 g B.4 G OG. 
1 641 618.6 99 188.8 16 14.8 1 ite) ) G.2 G 9.6 
2 63 65.6 Ad, UES MY) Seo g G.5 g 0.1 G6 8.9 
3 6 Tie. 3) 27 2 G.6 4) g.1 ) B.D G 2. 
4 1 g.8 2 %.4 GB 6.1 g B.D g B.D G6 B.D 
2 D G.1 g 0.1 ") 0.9 B B.D ) B.D G6 G.9 

TABLE A&.25: 2-PERIOD ACCIDENT FREQUENCIES FOR FEMALES AGED > 68 YEARS 

(Period 1: Mean=@.%572, Var(x)=%.9587; Period 2: Mean=%.%597) 

Per 
1 Observed and estimated no. of drivers with period 2 accident counts = 
Accs. "4 eel 2 3 4 5 
7) Taye FAG Nts} 496 493.2 ite Lomo Gg G.6 g 0.2 G 9.8 
1 382 SOOM Sh) ribet) 3 lew, g g.1 g B.D G 9. 
2 18 tS 1 eee: 4) Dent g G.D ) B.D G G9.o 
&) ) g.5 "4 9.1 B %.8 g G.D ) B.D G 9.0 
4 ) G.B ") G.D Y) B.D G B.D ) B.D G @. 
a 7) 0.2 g GB. G° 8. g 0.2 g B.D G B.D 
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